Polypeptides and polynucleotides, and uses thereof as a drug target for producing drugs and biologics

ABSTRACT

This invention relates to a novel target for production of immune and non-immune based therapeutics and for disease diagnosis. More particularly, the invention provides therapeutic antibodies against VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigens, which are predicted co-stimulatory family members and which are differentially expressed in cancers including, lung cancer, ovarian cancer, and colon cancer, and diagnostic and therapeutic usages. The use of these antibodies for modulating B7 costimulation and related therapies such as the treatment of autoimmunity are also provided. This invention further relates to the discovery of extracellular domains of VSIG1 and its variants, FXYD3 and its variants, ILDR1 and its variants, LOC253012 and its variants, AI216611 and its variants, and C1ORF32 and its variants awhich are suitable targets for immunotherapy, cancer therapy, and drug development.

FIELD OF THE INVENTION

This invention relates to the discovery of certain proteins that are differentially expressed in specific tissues and their use as therapeutic and diagnostic targets. More specifically the invention relates to a protein VSIG1 and its variants, FXYD3 and its variants, ILDR1 and its variants, LOC253012 and its variants, AI216611 and its variants, and C1ORF32 and its variants, which are differentially expressed by some cancers, and therefore are suitable targets for immunotherapy, cancer therapy, and drug development. This invention further relates to the discovery of extracellular domains of VSIG1 and its variants, FXYD3 and its variants, ILDR1 and its variants, LOC253012 and its variants, AI216611 and its variants, and C1ORF32 and its variants which are suitable targets for immunotherapy, cancer therapy, and drug development

Additionally, because some of the proteins of this invention, based on their B7-like structure, are believed to play a role in immune costimulation, the invention further relates to the use of these proteins, or drugs which modulate these proteins (agonistic and antagonistic), as immune modulators and for immune therapy, especially for treating cancer and immune related disorders such as cancers and autoimmune disorders. Also, the invention more specifically relates to therapeutic and diagnostic antibodies and therapies and diagnostic methods using same antibodies and antibody fragments that specifically bind to proteins of invention or a soluble or secreted portion thereof, especially the ectodomain.

BACKGROUND OF THE INVENTION

Tumor antigens are ideally positioned as biomarkers and drug targets, and they play a critical role in the development of novel strategies for active and passive immunotherapy agents, to be used as stand-alone therapies or in conjunction with conventional therapies for cancer. Tumor antigens can be classified as either tumor-specific antigens (TSAs) where the antigens are expressed only in tumor cells and not in normal tissues, or tumor-associated antigens (TAAs) where the antigens are overexpressed in tumor cells but nonetheless also present at low levels in normal tissues.

TAAs and TSAs are validated as targets for passive (antibody) therapy as well as active immunotherapy using strategies to break immune tolerance and stimulate the immune system. The antigenic epitopes that are targeted by these therapeutic approaches are present at the cell surface, overexpressed in tumor cells compared to non-tumor cells, and are targeted by antibodies that block functional activity, inhibit cell prohliferation, or induce cell death.

There are growing number of tumor-associated antigens against which monoclonal antibodies have been tested or are in use as treatment for cancer. The identification and molecular characterization of novel tumor antigens expressed by human malignancies is an active field in tumor immunology. Several approaches have been used to identify tumor-associated antigens as target candidates for immunotherapy, including high throughput bioinformatic approaches, based on genomics and proteomics. The identification of novel TAAs or TSAs expands the spectrum of tumor antigen targets available for immune recognition and provides new target molecules for the development of therapeutic agents for passive immunotherapy, including monoclonal antibodies, whether unmodified or armed. Such novel antigens may also point the way to more effective therapeutic vaccines for active or adoptive immunotherapy.

Cancer vaccination involves the administration of tumor antigens and is used to break immune tolerance and induce an active T-cell response to the tumor. Vaccine therapy includes the use of naked DNA, peptides, recombinant protein, and whole cell therapy, where the patient's own tumor cells are used as the source of the vaccine. With the identification of specific tumor antigens, vaccinations are more often carried out by dendritic cell therapy, whereby dendritic cells are loaded with the relevant protein or peptide, or transfected with vector DNA or RNA.

The major applications of anti-TAA antibodies for treatment of cancer are therapy with naked antibody, therapy with a drug-conjugated antibody, and fusion therapy with cellular immunity. Ever since their discovery, antibodies were envisioned as “magic bullets” that would deliver toxic agents, such as drugs, toxins, enzymes and radioisotopes, specifically to the diseased site and leaving the non-target normal tissues unaffected. Indeed, antibodies, and in particular antibody fragments, can function as carriers of cytotoxic substances such as radioisotopes, drugs and toxins. Immunotherapy with such immunoconjugates is more effective than with the naked antibody.

In contrast to the overwhelming success of naked (such as Rituxan and Campath) and conjugated antibodies (such as Bexxar and Zevalin) in treating hematological malignancies, only modest success has been achieved in the immunotherapy of solid tumors. One of the major limitations in successful application of immunotherapy to solid tumors is the large molecular size of the intact immunoglobulin that results in prolonged serum half-life but in poor tumor penetration and uptake. Indeed, only a very small amount of administered antibody (as low as 0.01%) reaches the tumor. In addition to their size, antibodies encounter other impediments before reaching their target antigens expressed on the cell surface of solid tumors. Some of the barriers include poor blood flow in large tumors, permeability of vascular endothelium, elevated interstitial fluid pressure of tumor stroma, and heterogenous antigen expression.

With the advent of antibody engineering, small molecular weight antibody fragments exhibiting improved tumor penetration have been generated. Such antibody fragments are often conjugated to specific cytotoxic molecules and are designed to selectively deliver them to cancer cells. Still, solid tumors remain a formidable challenge for therapy, even with immunoconjugated antibody fragments.

The new wave of optimization strategies involves the use of biological modifiers to modulate the impediments posed by solid tumors. Thus, in combination to antibodies or their conjugated antibody fragments, various agents are being used to improve the tumor blood flow, enhance vascular permeability, lower tumor interstitial fluid pressure by modulating stromal cells and extracellular matrix components, upregulate expression of target antigens and improve penetration and retention of the therapeutic agent.

Immunotherapy with antibodies represents an exciting opportunity for combining with standard modalities, such as chemotherapy, as well as combinations with diverse biological agents to obtain a synergistic activity. Indeed, unconjugated mAbs are more effective when used in combination with other therapeutic agents, including other antibodies.

Another component of the immune system response to immunotherapy is the cellular response, specifically—the T cell response and activation of cytotoxic T cells (CTLs). The efficiency of the immune system in mediating tumor regression depends on the induction of antigen-specific T-cell responses through physiologic immune surveillance, priming by vaccination, or following adoptive transfer of T-cells. Although a variety of tumor-associated antigens have been identified and many immunotherapeutic strategies have been tested, objective clinical responses are rare. The reasons for this include the inability of current immunotherapy approaches to generate efficient T-cell responses, the presence of regulatory cells that inhibit T-cell responses, and other escape mechanisms that tumors develop, such as inactivation of cytolytic T-cells through expression of negative costimulatory molecules. Effective immunotherapy for cancer will require the use of appropriate tumor-specific antigens; the optimization of the interaction between the antigenic peptide, the APC and the T cell; and the simultaneous blockade of negative regulatory mechanisms that impede immunotherapeutic effects.

T-cell activation plays a central role in driving both protective and pathogenic immune responses, and it requires the completion of a carefully orchestrated series of specific steps that can be preempted or disrupted by any number of critical events. Naïve T cells must receive two independent signals from antigen-presenting cells (APC) in order to become productively activated. The first, Signal 1, is antigen-specific and occurs when T cell antigen receptors encounter the appropriate antigen-MHC complex on the APC. A second, antigen-independent signal (Signal 2) is delivered through a T cell costimulatory molecule that engages its APC-expressed ligand. In the absence of a costimulatory signal, T-cell activation is impaired or aborted, which may lead to a state of antigen-specific unresponsiveness (known as T-cell anergy), or may result in T-cell apoptotic death.

Costimulatory signals can be either stimulatory (positive costimulation) or inhibitory (negative costimulation or coinhibition). Positive costimulation is required for optimal activation of naïve T cells, while negative costimulation is required for the acquisition of immunologic tolerance to self, as well as the termination of effector T cell functions. Costimulatory signals, particularly positive costimulatory signals, also play a role in the modulation of B cell activity. For example, B cell activation and the survival of germinal center B cells require T cell-derived signals in addition to stimulation by antigen.

Both positive and negative costimulatory signals play critical roles in the regulation of cell-mediated immune responses, and molecules that mediate these signals have proven to be effective targets for immunomodulation. Based on this knowledge, several therapeutic approaches that involve targeting of costimulatory molecules have been developed, and were shown to be useful for prevention and treatment of cancer and autoimmune diseases, as well as rejection of allogenic transplantation, each by turning on, or preventing the turning off, of immune responses in subjects with these pathological conditions.

Costimulatory molecule pairs usually consist of ligands expressed on APCs and their cognate receptors expressed on T cells. The well characterized B7/CD28 and CD40/CD40L costimulatory molecules are critical in primary T-cell activation. In recent years, several additional costimulatory molecules have been identified, that belong to the B7/CD28 or the TNF/TNF-R gene families. The effects of costimulatory TNFR family members can often be functionally, temporally, or spatially segregated from those of CD28 family members and from each other. The sequential and transient regulation of T cell activation/survival signals by different costimulators may function to allow longevity of the response while maintaining tight control of T cell survival.

The B7 family consists of structurally related, cell-surface protein ligands, which bind to receptors on lymphocytes that regulate immune responses. Interaction of B7-family members with their respective costimulatory receptor, usually a member of the CD28-related family, augments immune responses, while interaction with coinhibitory receptors, such as CTLA4, attenuates immune responses. Members of the B7 family share 20-40% amino-acid identity and are structurally related, with the extracellular domain containing tandem domains related to variable and constant immunoglobulin domains.

There are currently seven known members of the family: B7.1 (CD80), B7.2 (CD86), B7-H1 (PD-L1), B7-H2 (ICOS-L), B7-DC (PD-L2), B7-H3, and B7-H4, each with unique, yet often overlapping functions. Clearly, each B7 molecule has developed its own indispensable niche in the immune system. As specific niches of B7 family members continue to be dissected, their diagnostic and therapeutic potential becomes ever more apparent. Many of the B7 superfamily members were initially characterized as T cell costimulatory molecules. However, more recently it has become clear they can also coinhibit T cell responses. Thus, B7 family members may have opposing effects on an immune response.

Central to the normal function of the immune system is its ability to distinguish between self and non-self, since failure to do so could provoke the onset of autoimmune disease. Most autoimmune disorders are known to involve autoreactive T cells and/or autoantibodies. Thus, agents that are capable of inhibiting or eliminating autoreactive lymphocytes have a promising therapeutic potential. Furthermore, the use of agents that exhibit such immunosuppressive activity should also be beneficial in order to inhibit normal immune responses to alloantigens in patients receiving a transplant. Thus, novel agents that are capable of modulating costimulatory signals, without compromising the immune system's ability to defend against pathogens, are highly advantageous for treatment and prevention of such pathological conditions.

The importance of the B7 family members in regulating immune responses to self and allo-antigens was demonstrated by the development of immunodeficiency and autoimmune diseases in mice with mutations in B7-family genes. Accordingly, manipulation of the signals delivered by B7 ligands has shown potential in the treatment of autoimmunity, inflammatory diseases, and transplant rejection. This approach relies, at least partially, on the eventual deletion of auto- or allo-reactive T cells, presumably because in the absence of costimulation (which induces cell survival genes) T cells become highly susceptible to induction of apoptosis.

Harnessing the immune system to treat chronic diseases is a major goal of immunotherapy. Active and passive immunotherapies are proving themselves as effective therapeutic strategies. Passive immunotherapy, using monoclonal antibodies or receptor Fc-fusion proteins, has come of age and has shown great clinical success. A growing number of such therapeutic agents have been approved or are in clinical trials to prevent allograft rejection or to treat autoimmune diseases and cancer. Active immunotherapy (i.e. vaccines) has been effective against agents that normally cause acute self-limiting infectious diseases followed by immunity and has been at the forefront of efforts to prevent the infectious diseases that plague humankind. However, active immunotherapy has been much less effective against cancer or chronic infectious diseases primarily because these have developed strategies to escape normal immune responses. Among these are negative costimulators of the B7 family, such as B7-H1 and B7-H4, which are highly expressed in certain tumors, and afford local protection from immune cells-mediated attack.

The efficiency of the immune system in mediating tumor regression depends on the induction of antigen-specific T-cell responses through physiologic immune surveillance, priming by vaccination, or following adoptive transfer of T-cells. Although a variety of tumor-associated antigens have been identified and many immunotherapeutic strategies have been tested, objective clinical responses are rare. The reasons for this include the inability of current immunotherapy approaches to generate efficient T-cell responses, the presence of regulatory cells that inhibit T-cell responses, and other escape mechanisms that tumors develop, such as inactivation of cytolytic T-cells through expression of negative costimulatory molecules. Effective immunotherapy for cancer will require the use of appropriate tumor-specific antigens; the optimization of the interaction between the antigenic peptide, the APC and the T cell; and the simultaneous blockade of negative regulatory mechanisms that impede immunotherapeutic effects.

Costimulators of the B7 family play a critical role in activation and inhibition of antitumor immune responses. Novel agents targeting these molecules could find significant use in the modulation of immune responses and the improvement of cancer immunotherapy. Such agents could be administered in conjunction with tumor-specific antigens, as an adjuvant that serves to enhance the immune response to the antigen in the patient. In addition, such agents could be of use in other types of cancer immunotherapy, such as adoptive immunotherapy, in which tumor-specific T cell populations are expanded and directed to attack and kill tumor cells. Agents capable of augmenting such anti-tumor response have great therapeutic potential and may be of value in the attempt to overcome the obstacles to tumor immunotherapy.

Passive tumor immunotherapy uses the exquisite specificity and lytic capability of the immune system to target tumor specific antigens and treat malignant disease with a minimum of damage to normal tissue. Several approaches have been used to identify tumor-associated antigens as target candidates for immunotherapy. The identification of novel tumor specific antigens expands the spectrum of tumor antigen targets available for immune recognition and provides new target molecules for the development of therapeutic agents for passive immunotherapy, including monoclonal antibodies, whether unmodified or armed. Such novel antigens may also point the way to more effective therapeutic vaccines for active or adoptive immunotherapy.

Clinical development of costimulation blockade came to fruition with the approval of CTLA4Ig (abatacept) for rheumatoid arthritis. This soluble fusion protein, which acts as competitive inhibitor of the B7/CD28 costimulatory pathway, is also in clinical trials for other immune diseases such as psoriasis and multiple sclerosis, and for transplant rejection. Promising results have also been obtained in a phase II clinical trial in kidney transplantation with belatacept, a re-engineered CTLA4Ig with enhanced binding affinity to its ligands, B7.1 and B7.2 (CD80 and CD86, respectively). Two fully human anti-CTLA4 monoclonal antibodies, Ipilimumab and tremelimumab, abrogate the CTLA4/B7 inhibitory interaction, and are in clinical phase III for metastatic melanoma and other cancers, as well as HIV infection. Galiximab is a primatized monoclonal antibody targeting CD80, in Phase II for rheumatoid arthritis, psoriasis and Non-Hodgkin's lymphoma.

It is important to point out that strategies that use single agents to block costimulation have often proved to be insufficient. Given the diversity of the different costimulation molecules, future strategies may involve the simultaneous blockade of several selected pathways or combination therapy with conventional drugs, such as immunosuppressants for immune-related disorders or cytotoxic drugs for cancer.

Despite recent progress in the understanding of cancer biology and cancer treatment, as well as better understanding of the molecules involved in immune responses, the success rate for cancer therapy and for the treatment of autoimmune diseases remains low. Therefore, there is an unmet need for new therapies which can successfully treat both cancer and autoimmune disorders.

BRIEF SUMMARY OF THE INVENTION

It is an object of the invention to provide novel therapeutic and diagnostic compositions containing at least one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 proteins or one of the novel splice variants disclosed herein as well as to provide these novel VSIG1 splice variants; specifically ILDR1 splice variants; LOC253012 splice variants; AI216611 splice variants, C1ORF32 splice variants; and FXYD3 splice variants, and nucleic acid sequences encoding for same or fragments thereof especially the ectodomain or secreted forms of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants.

It is another object of the invention to use said proteins, splice variants and nucleic acid sequences as novel targets for development of drugs which specifically bind to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants, and/or drugs which agonize or antagonize the binding of other moieties to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants.

It is still another object of the invention to provide drugs which modulate (agonize or antagonize) at least one VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 related biological activity. Such drugs include by way of example antibodies, small molecules, peptides, ribozymes, antisense molecules, siRNA's and the like. These molecules may directly bind or modulate an activity elicited by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 DNA or portions or variants thereof or may indirectly modulate a VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 associated activity or binding of molecules to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 and portions and variants thereof such as by modulating the binding of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 to its counterreceptor or endogenous ligand.

In more specific embodiments, the present invention provides novel splice variants of a known protein V-set and immunoglobulin domain containing 1 (SEQ ID NO:11) (RefSeq accession identifier NP_872413, synonyms: RP5-889N15.1, 1700062D20Rik, GPA34, MGC44287, dJ889N15.1) or a polynucleotide encoding same, which can be used as diagnostic markers and/or therapeutic agents which agonize or antagonize the binding of other moieties to the VSIG1 proteins and/or which modulate (agonize or antagonize) at least one VSIG1 related biological activity.

According to one more specific embodiment, the novel splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in any one of AI581519_T10 (SEQ ID NO:9), AI581519_T11 (SEQ ID NO:10), or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95% homologous to any one of AI581519_T10 (SEQ ID NO:9), AI581519_T11 (SEQ ID NO:10).

According to yet another more specific embodiment, the novel splice variant is an isolated protein or polypeptide having an amino acid sequence as set forth in any one of AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), or a sequence homologous thereto. According to another embodiment, the isolated polypeptide is at least 95% homologous to any one of AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16).

It is another specific object of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the VSIG1 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in immunotherapy (promoting or inhibiting immune costimulation).

In more specific embodiments the present invention provides discrete portions of the VSIG1 proteins including different portions of the extracellular domain corresponding to residues 23-234 of the VSIG1 protein sequence contained in the sequence of AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or residues 23-270 of the of the VSIG1 protein sequence contained in the sequence of AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or residues 23-296 of the VSIG1 protein sequence contained in the sequence of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or residues 23-193 of the VSIG1 protein sequence contained in the sequence of AI581519_P7 (SEQ ID NO:14) corresponding to amino acid sequence depicted in SEQ ID NO:141, or residues 23-203 of the VSIG1 protein sequence contained in the of AI581519_P9 (SEQ ID NO:15) corresponding to amino acid sequence depicted in SEQ ID NO:142, or residues 23-231 of the VSIG1 protein sequence contained in the sequence of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of the VSIG1 protein sequence contained in the sequence of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302, or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

According to other more specific embodiments, the present invention provides novel splice variants of a known protein immunoglobulin-like domain containing receptor 1 (SEQ ID NO:21) (RefSeq accession identifier NP_787120, also known as ILDR1alpha, ILDR1beta, ILDR1), or a polynucleotide encoding same, which can be used as diagnostic markers and/or therapeutic agents which agonize or antagonize the binding of other moieties to the ILDR1 proteins and/or which modulate (agonize or antagonize) at least one ILDR1 related biological activity.

In one specific embodiment, the novel splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in AA424839_1_T7 (SEQ ID NO:20), or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95, 96, 97, 98 or 99% homologous to AA424839_1_T7 (SEQ ID NO:20).

According to yet another specific embodiment, the novel splice variant is an isolated protein or polypeptide having an amino acid sequence as set forth in AA424839_1_P11 (SEQ ID NO:24), or a sequence homologous there, i.e., which possesses at least 80, or 90% sequence identity therewith. According to another related embodiment, the isolated polypeptide is at least 95, 96, 97, 98 or 99% homologous to AA424839_1_P11 (SEQ ID NO:24).

It is another embodiment of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the ILDR1 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in immunotherapy (promoting or inhibiting immune costimulation).

According to yet further embodiments the present invention provides discrete portions of the ILDR1 proteins including different portions of the extracellular domain corresponding to residues 24-162 of sequences AA424839_P3 (SEQ ID NO:22) and AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, or residues 24-457 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, or residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301, or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

It is another embodiment of the invention to provide an isolated or purified soluble protein or nucleic acid sequence having or encoding the extracellular domain of the ILDR1 protein which optionally may be directly or indirectly attached to a non-ILDR1 protein or nucleic acid sequence such as a soluble immunoglobulin domain or fragment.

According to certain embodiments, the present invention provides novel splice variants of a known hypothetical protein LOC253012 isoform 1 (SEQ ID NO:35) (RefSeq accession identifier NP_001034461) or a polynucleotide encoding same, and their use as diagnostic markers and/or as therapeutic agents which agonize or antagonize the binding of other moieties to the LOC253012 proteins and/or which modulate (agonize or antagonize) at least one LOC253012 related biological activity.

According to one embodiment, the novel LOC253012 splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in any one of H68654_1_T8 (SEQ ID NO:28), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), or H68654_1_T20 (SEQ ID NO:34) or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95% homologous to any one of H68654_1_T8 (SEQ ID NO:28), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), or H68654_1_T20 (SEQ ID NO:34).

According to yet another embodiment, the novel LOC253012 splice variant is an isolated protein or polypeptide having an amino acid sequence as set forth in any one of H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40) or a sequence homologous thereto. According to another embodiment, the isolated polypeptide is at least 95, 96, 97, 98 or 99% homologous to any one of H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40).

It is another object of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the LOC253012 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in immunotherapy (promoting or inhibiting immune costimulation).

According to yet further embodiments of the present invention there are discrete portions of the LOC253012 proteins including different portions of the extracellular domain corresponding to residues 38-349 of the sequence H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of the sequences H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

It is another object of the invention to provide an isolated or purified soluble protein or nucleic acid sequence encoding having or encoding the extracellular domain of the LOC253012 protein which optionally may be directly or indirectly attached to a non-LOC253012 protein or nucleic acid sequence such as a soluble immunoglobulin domain or fragment.

According to certain embodiments, the present invention provides novel splice variants of AI216611, or a polynucleotide encoding same, which can be used as diagnostic markers and/or therapeutic agents which agonize or antagonize the binding of other moieties to the AI216611 proteins and/or which modulate (agonize or antagonize) at least one AI216611 related biological activity.

According to one embodiment, the novel AI216611 splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in AI216611_T1 (SEQ ID NO:42), or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95, 96, 97, 98 or 99% homologous to AI216611_T1 (SEQ ID NO:42).

According to yet another embodiment, the novel AI216611 splice variant is an isolated protein or polypeptide having an amino acid sequence as set forth in AI216611_P1 (SEQ ID NO:44) or a sequence homologous thereto. According to another embodiment, the isolated polypeptide is at least 95, 96, 97, 98 or 99% homologous to AI216611_P1 (SEQ ID NO:44).

It is another object of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the AI216611 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in immunotherapy (such as promoting or inhibiting immune costimulation). According to yet further embodiments of the present invention there are discrete portions of the AI216611 proteins including different portions of the extracellular domain corresponding to residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), corresponding to amino acid sequence depicted in SEQ ID NO:146, or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298, or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

It is another object of the invention to provide an isolated or purified soluble protein or nucleic acid sequence having or encoding the extracellular domain of the AI216611 protein which optionally may be directly or indirectly attached to a non-AI216611 protein or nucleic acid sequence such as a soluble immunoglobulin domain or fragment.

It is another object of the invention to provide vectors such as plasmids and recombinant viral vectors and host cells containing that express AI216611, its secreted or soluble form and/or the ECD of the AI216611 protein and variants thereof or polypeptide conjugates containing any of the foregoing.

According to certain embodiments, the present invention provides novel splice variants of a known hypothetical protein LOC387597 (SEQ ID NO:47) (RefSeq accession identifier NP_955383, synonyms: NP_955383; LISCH-like; C1ORF32; RP4-782G3.2; dJ782G3.1) or a polynucleotide encoding same, which can be used as diagnostic markers and/or therapeutic agents which agonize or antagonize the binding of other moieties to the C1ORF32 proteins and/or which modulate (agonize or antagonize) at least one C1ORF32 related biological activity.

According to one embodiment, the novel LOC387597 splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in any one of H19011_1_T8 (SEQ ID NO:45), H19011_1_T9 (SEQ ID NO:46), or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95, 96, 97, 98 or 99% homologous to any one of H19011_1_T8 (SEQ ID NO:45), H19011_1_T9 (SEQ ID NO:46).

According to yet another embodiment, the novel splice LOC387597 variant is an isolated protein or polypeptide having an amino acid sequence as set forth in any one of H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50) or a sequence homologous thereto. According to another embodiment, the isolated polypeptide is at least 95, 96, 97, 98 or 99% homologous to any one of H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50).

It is another object of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the C1ORF32 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in immunotherapy (promoting or inhibiting immune costimulation).

According to yet further embodiments of the present invention there are discrete portions of the C1ORF32 proteins including different portions of the extracellular domain corresponding to residues 21-186 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of the sequence H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94) or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

It is another object of the invention to provide an isolated or purified soluble protein or nucleic acid sequence encoding having or encoding the extracellular domain of the C1ORF32 protein which optionally may be directly or indirectly attached to a non-C1ORF32 protein or nucleic acid sequence such as a soluble immunoglobulin domain or fragment.

According to certain embodiments, the present invention provides novel splice variants of known protein FXYD3, FXYD domain-containing ion transport regulator 3 precursor (SEQ ID NO:70) (SwissProt accession identifier FXYD3 HUMAN; known also according to the synonyms Chloride conductance inducer protein Mat-8; Mammary tumor 8 kDa protein; Phospholemman-like) or a polynucleotide encoding same, which can be used as diagnostic markers and/or therapeutic agents which agonize or antagonize the binding of other moieties to the FXYD3 proteins and/or which modulate (agonize or antagonize) at least one FXYD3 related biological activity.

According to one embodiment, the novel FXYD3 splice variant is an isolated polynucleotide comprising a nucleic acid having a nucleic acid sequence as set forth in any one of R31375_T19 (SEQ ID NO:65), R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T39 (SEQ ID NO:69), or a sequence homologous thereto. According to another embodiment, the isolated polynucleotide is at least 95, 96, 97, 98 or 99% homologous to any one of R31375_T19 (SEQ ID NO:65), R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T39 (SEQ ID NO:69).

According to yet another embodiment, the novel FXYD3 splice variant is an isolated protein or polypeptide having an amino acid sequence as set forth in any one of R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74) or a sequence homologous thereto. According to another embodiment, the isolated polypeptide is at least 95, 96, 97, 98 or 99% homologous to any one of R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74).

It is another object of the invention to provide molecules and isolated polypeptides comprising the soluble ectodomain (ECD) of the FXYD3 proteins and fragments thereof as well as nucleic acid sequences encoding said soluble ectodomain, as well as fragments thereof and conjugates and the use thereof as therapeutics including their use in cancer immunotherapy.

According to yet further embodiments of the present invention there are discrete portions of the FXYD3 proteins including different portions of the extracellular domain corresponding to residues 21-36 of the sequence R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of the sequence R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297, or variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

It is another object of the invention to provide an isolated or purified soluble protein or nucleic acid sequence encoding having or encoding the extracellular domain of any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 proteins which optionally may be directly or indirectly attached to a non-VSIG1, non-ILDR1, non-LOC253012, non-AI216611, non-C1ORF32 or non-FXYD3 protein or nucleic acid sequence, respectively, such as a soluble immunoglobulin domain or fragment.

It is another object of the invention to provide molecules and isolated polypeptides comprising edge portion, tail or head portion, of any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 novel variants of the invention, or a homologue or a fragment thereof as well as nucleic acid sequences encoding said edge portion, tail or head portion, as well as fragments thereof and conjugates and the use thereof as therapeutics and/or for diagnostics.

It is further object of the invention to provide molecules and isolated polypeptides comprising a bridge, edge portion, tail or head portion, as depicted in any one of SEQ. ID NOs: 284-295, or a homologue or a fragment thereof as well as nucleic acid sequences encoding said edge portion, tail or head portion, as well as fragments thereof and conjugates and the use thereof as therapeutics and/or for diagnostics.

It is another object of the invention to provide vectors such as plasmids and recombinant viral vectors and host cells containing the vectors that express any one of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, its secreted or soluble form and/or the ECD of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 protein and variants thereof or polypeptide conjugates containing any of the foregoing.

It is another object of the invention to use these vectors such as plasmids and recombinant viral vectors and host cells containing that express any one of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, its secreted or soluble form and/or the ECD of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 protein and variants thereof or polypeptide conjugates containing any of the foregoing to produce said VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 protein, fragments or variants thereof and/or conjugates containing any one of the foregoing.

It is another object of the invention to provide pharmaceutical or diagnostic compositions containing any of the foregoing.

It is another object of the invention to provide and use compounds including VSIG1 ectodomain or fragments or variants thereof, which are suitable for treatment or prevention of cancer, autoimmune disorders, transplant rejection, graft versus host disease, and/or for blocking or promoting immune costimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 polypeptide.

It is a specific object of the invention to develop novel monoclonal or polyclonal antibodies and antibody fragments and conjugates containing that specifically bind the full length VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, selected from the group consisting of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), its secreted form and/or the ECD thereof or conjugates or fragments thereof. These antibodies are potentially useful as therapeutics and/or diagnostic agents (both in vitro and in vivo diagnostic methods). Included in particular are antibodies and fragments that are immune activating or immune suppressing such as antibodies or fragments that target cells via ADCC (antibody dependent cellular cytotoxicity) or CDC (complement dependent cytotoxicity) activities.

It is another object of the invention to provide diagnostic methods that include the use of any of the foregoing including by way of example immunohistochemical assay, radioimaging assays, in-vivo imaging, radioimmunoassay (RIA), ELISA, slot blot, competitive binding assays, fluorimetric imaging assays, Western blot, FACS, and the like. In particular this includes assays which use chimeric or non-human antibodies or fragments that specifically bind the intact VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 protein, selected from the group consisting of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), its soluble form, its ECD, and or conjugates, fragments or variants thereof.

It is another object of the invention to use novel therapeutically effective polyclonal or monoclonal antibodies against anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, selected from the group consisting of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), and fragments, conjugates, and variants thereof for treating conditions wherein the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or its secreted or soluble form or ECD and/or portions or variants thereof are differentially expressed including various cancers and malignancies including non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

It is another object of the invention to use novel therapeutically effective polyclonal or monoclonal antibodies against anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, selected from the group consisting of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), and fragments, conjugates and variants thereof for treating non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

It is a specific object of the invention to use antibodies and antibody fragments against VSIG1 antigen, its secreted or soluble form or ECD and/or variants, conjugates, or fragments thereof and fragments and variants thereof for treating and diagnosing lung cancer and/or ovarian cancer, wherein this antigen is differentially expressed.

It is a specific embodiment of the invention to use antibodies and antibody fragments against ILDR1 antigen, its secreted or soluble form or ECD and/or variants, conjugates, or fragments thereof and fragments and variants thereof for treating and diagnosing colon and/or ovarian cancers wherein this antigen is differentially expressed.

It is a specific object of the invention to use antibodies and antibody fragments against LOC253012 or C1ORF32 antigen, its secreted or soluble form or ECD and/or variants, conjugates, or fragments thereof and fragments and variants thereof for treating and diagnosing lung cancer, particularly small cell lung carcinoma, wherein this antigen is differentially expressed.

It is a specific object of the invention to use antibodies and antibody fragments against AI216611 antigen, its secreted or soluble form or ECD and/or variants, conjugates, or fragments thereof and fragments and variants thereof for treating and diagnosing colon cancer, wherein this antigen is differentially expressed.

It is a specific object of the invention to use antibodies and antibody fragments against FXYD3 wild type antigen (R31375_P0 (SEQ ID NO:70)), or antibodies and antibody fragments against its secreted or soluble form or ECD and conjugates containing for treating and diagnosing ovarian cancer, wherein this antigen is differentially expressed.

It is another object of the invention to use antibodies and antibody fragments, and conjugates containing, against the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, selected from the group consisting of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74) in modulating (enhancing or inhibiting) immunity including antibodies that activate or suppress the immune co-stimulation in particular B7 related immune costimulation and are capable of treating related therapeutic applications, through positive stimulation of T cell activity against cancer cells, and negative stimulation of T cell activity for the treatment of autoimmunity and other immune disorders.

It is another specific object of the invention to produce antibodies and antibody fragments against discrete portions of the VSIG1 proteins including different portions of the extracellular domain corresponding to residues 23-234 of the VSIG1 protein sequence contained in the sequence of AI581519_P3 (SEQ ID NO:11) corresponding to amino acid sequence depicted in SEQ ID NO:138, or residues 23-270 of the VSIG1 protein sequence contained in the sequence of AI581519_P4 (SEQ ID NO:12) corresponding to amino acid sequence depicted in SEQ ID NO:139, or residues 23-296 of the VSIG1 protein sequence contained in the sequence of AI581519_P5 (SEQ ID NO:13) corresponding to amino acid sequence depicted in SEQ ID NO:140, or residues 23-193 of the VSIG1 protein sequence contained in the sequence of AI581519_P7 (SEQ ID NO:14) corresponding to amino ac, id sequence depicted in SEQ ID NO:141, or residues 23-203 of the VSIG1 protein sequence contained in the sequence of AI581519_P9 (SEQ ID NO:15) corresponding to amino acid sequence depicted in SEQ ID NO:142, or residues 23-231 of the VSIG1 protein sequence contained in the sequence of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of the VSIG1 protein sequence contained in the sequence of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302.

It is another specific embodiment of the invention to produce antibodies and antibody fragments against discrete portions of the ILDR1 proteins including different portions of the extracellular domain corresponding to residues 24-162 of sequences AA424839_P3 (SEQ ID NO:22) and AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, residues 24-457 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, and residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301 of the ILDR1 protein sequences disclosed herein.

It is another specific object of the invention to produce antibodies and antibody fragments against discrete portions of the LOC253012 proteins including different portions of the extracellular domain corresponding to residues 38-349 of the sequence H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of the sequences H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, of the LOC253012 protein sequences disclosed herein.

It is another specific object of the invention to produce antibodies and antibody fragments against discrete portions of the AI216611 proteins including different portions of the extracellular domain corresponding to residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), corresponding to amino acid sequence depicted in SEQ ID NO:146, or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298 sequence disclosed herein.

It is another specific object of the invention to produce antibodies and antibody fragments against discrete portions of the C1ORF32 proteins including different portions of the extracellular domain corresponding to residues 21-186 of the C1ORF32 protein sequence contained in the sequence of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of the C1ORF32 protein sequence contained in the sequence of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94).

It is another specific object of the invention to produce antibodies and antibody fragments against discrete portions of the FXYD3 proteins including different portions of the extracellular domain corresponding to residues 21-36 of the FXYD3 protein sequence contained in the sequence of R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of the FXYD3 protein sequence contained in the sequence of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of the FXYD3 protein sequence contained in the sequence of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297.

It is a specific object of the invention to provide polyclonal and monoclonal antibodies and fragments thereof or an antigen binding fragment thereof comprising an antigen bindings site that binds specifically to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 proteins, its soluble forms, the ECD thereof and/or variants and fragments thereof.

It is a specific object of the invention to use such antibodies and fragments thereof for treatment or prevention of cancer and/or for modulating (activating or blocking) the activity of the target in the immune co-stimulatory system.

It is a related object of the invention to select monoclonal and polyclonal antibodies and fragments thereof against VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 which are suitable for treatment or prevention of autoimmune disorders, transplant rejection, GVHD, and/or for blocking or enhancing immune costimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 polypeptide.

It is a specific object of the invention to use antibodies against anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, soluble form, ECD or fragment or variant thereof for the treatment and diagnosis of cancers including by way of example lung cancer, ovarian cancer, colon cancer, as well as other non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

With regard to lung cancer, the disease is selected from the group consisting of squamous cell lung carcinoma, lung adenocarcinoma, carcinoid, small cell lung cancer or non-small cell lung cancer.

It is another object of the invention to provide and use antibodies and antibody fragments against anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, its soluble form, or ECD and variants or fragments thereof as well as soluble polypeptides containing the ectodomain of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or a portion thereof which are useful for immune modulation, including treatment of autoimmunity and preferably for treating an autoimmune disease selected from autoimmune diseases: Multiple sclerosis; Psoriasis; Rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitus, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

It is another object of the invention to provide and use compounds including drugs such as small molecules, peptides, antibodies and fragments that bind anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, as well as ribozymes or antisense or siRNAs which target the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 nucleic acid sequence or fragments or variants thereof which are useful for treatment or prevention of cancer, autoimmune disorders, transplant rejection, GVHD, and/or for blocking or enhancing immune costimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 polypeptide.

It is another object of the invention to provide and use compounds including drugs such as small molecules, peptides, antibodies and fragments that bind the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, as well as ribozymes or antisense or siRNAs which target the FXYD3 nucleic acid sequence or fragments or variants thereof which are useful for treatment or prevention of cancer.

It is a preferred object to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 23-234 of the sequence AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or residues 23-270 of the sequence AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or residues 23-296 of the sequence AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or residues 23-193 of the sequence AI581519_P7 (SEQ ID NO:14), corresponding to amino acid sequence depicted in SEQ ID NO:141, or residues 23-203 of the sequence AI581519_P9 (SEQ ID NO:15), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 23-231 of the sequence AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of the sequence AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302 of the VSIG1 protein sequences disclosed herein.

It is a preferred embodiment to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 24-162 of the ILDR1 protein sequence contained in the sequence of AA424839_P3 (SEQ ID NO:22) and AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, residues 24-457 of the ILDR1 protein sequence contained in the sequence of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, and residues 24-105 of the ILDR1 protein sequence contained in the sequence of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301.

It is a preferred object to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 38-349 of the LOC253012 protein sequence contained in the sequence of H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of the of the LOC253012 protein sequence contained in the sequences of H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300.

It is a preferred object to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 29-147 of the AI216611 protein sequence contained in the sequence of AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), corresponding to amino acid sequence depicted in SEQ ID NO:146, or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298.

It is a preferred object to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 21-186 of the C1ORF32 protein sequence contained in the sequence of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of the sequence of the C1ORF32 protein sequence contained in the sequence of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94).

It is a preferred object to provide therapeutic and diagnostic antibodies and fragments and conjugates containing useful in treating or diagnosing any of the foregoing that specifically bind to amino-acids residues 21-36 of the FXYD3 protein sequence contained in the sequence of R31375_P0 (SEQ ID NO:70), or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149 or residues 21-65 of the FXYD3 protein sequence contained in the sequence of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues or residues 21-25 of the FXYD3 protein sequence contained in the sequence of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297.

It is also a preferred object to provide antibodies and fragments thereof that bind to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 and the specific residues above-identified and fragments thereof, wherein the antibody is a chimeric, humanized, fully human antibody and/or is an antibody or antibody fragment having CDC or ADCC activities on target cells.

It is also a preferred object to provide chimeric and human antibodies and fragments thereof and conjugates containing that bind to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 and the specific residues above-identified and fragments thereof.

It is another specific object of the invention to provide antibody fragments and conjugates containing useful in the foregoing therapies and related diagnostic methods including but not limited to Fab, F(ab′)2, Fv or scFv fragment.

It is also an object of the invention to directly or indirectly attach the subject antibodies and fragments to markers and other effector moieties such as a detectable marker, or to an effector moiety such as an enzyme, a toxin, a therapeutic agent, or a chemotherapeutic agent.

In a preferred embodiment the inventive antibodies or fragments may be attached directly or indirectly to a radioisotope, a metal chelator, an enzyme, a fluorescent compound, a bioluminescent compound or a chemiluminescent compound.

It is also an object of the invention to provide pharmaceutical and diagnostic compositions that comprise a therapeutically or diagnostically effective form of an antibody or antibody fragment according to the invention.

It is another specific object of the invention to inhibit the growth of cells that express VSIG1 in a subject, comprising: administering to said subject an antibody that specifically binds to the antigen referred to herein as AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16) or VSIG1.

It is another specific object of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically bind AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16) or VSIG1.

It is a more preferred object of the invention to use these antibodies for treating cancers selected from the group consisting of lung cancer, and ovarian cancer, and wherein the lung cancer or the ovarian cancer is non-metastatic, invasive or metastatic, wherein preferably the antibody has an antigen-binding region specific for the extracellular domain of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16).

It is another object of the invention to provide methods for treating or preventing autoimmune diseases, comprising administering to a patient an effective amount of a polyclonal or monoclonal antibody or fragment or a conjugate containing that specifically bind AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16).

It is another specific embodiment of the invention to inhibit the growth of cells that express ILDR1 in a subject, comprising: administering to said subject an antibody that specifically binds to the antigen referred to herein as AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24) or ILDR1.

It is another specific embodiment of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically binds to AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24) or ILDR1.

It is a more preferred embodiment of the invention to use these antibodies for treating cancers selected from the group consisting of colon cancer or ovarian cancer, and wherein the colon cancer or the ovarian cancer is non-metastatic, invasive or metastatic wherein preferably the antibody has an antigen-binding region specific for the extracellular domain of AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23) or AA424839_1_P11 (SEQ ID NO:24).

It is another embodiment of the invention to provide methods for treating or preventing autoimmune diseases, comprising administering to a patient an effective amount of a polyclonal or monoclonal antibody or fragment that specifically binds AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24).

It is another specific object of the invention to inhibit the growth of cells that express LOC253012 in a subject, comprising: administering to said subject an antibody that specifically binds to the antigen referred to herein as H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40) or LOC253012.

It is another specific object of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically bind H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40) or LOC253012.

It is a more preferred object of the invention to use these antibodies for treating cancers selected from the group consisting of lung cancer, especially small cell lung carcinoma, and wherein the lung cancer is non-metastatic, invasive or metastatic wherein preferably the antibody has an antigen-binding region specific for the extracellular domain of H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40).

It is another object of the invention to provide methods for treating or preventing autoimmune diseases, comprising administering to a patient an effective amount of a polyclonal or monoclonal antibody or fragment that specifically bind H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40).

It is another specific object of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically binds to AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44).

It is another object of the invention to provide methods for treating or preventing autoimmune diseases, comprising administering to a patient an effective amount of a polyclonal or monoclonal antibody or fragment or a conjugate containing that specifically bind AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44).

It is another specific object of the invention to inhibit the growth of cells that express C1ORF32 in a subject, comprising: administering to said subject an antibody that specifically binds to the antigen referred to herein as H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), or C1ORF32.

It is another specific object of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically binds to H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50) or C1ORF32.

It is a more preferred object of the invention to use these antibodies for treating cancers selected from the group consisting of lung cancer, particularly lung small cell carcinoma, and wherein the lung cancer is non-metastatic, invasive or metastatic, wherein preferably the antibody has an antigen-binding region specific for the extracellular domain of H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50).

It is another object of the invention to provide methods for treating or preventing autoimmune diseases, comprising administering to a patient an effective amount of a polyclonal or monoclonal antibody or fragment that specifically bind H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50).

It is another specific object of the invention to inhibit the growth of cells that express FXYD3 in a subject, comprising: administering to said subject an antibody that specifically binds to the antigen referred to herein as R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74).

It is another specific object of the invention to use part or all of the ectodomain of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 or its variants and conjugates containing for administration as an anti-cancer vaccine, for immunotherapy of cancer, including but not limited to ovarian cancer.

It is another specific object of the invention to provide methods for treating or preventing cancer, comprising administering to a patient an effective amount of a monoclonal antibody that specifically binds to R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74).

It is a more preferred object of the invention to use these antibodies for treating ovarian cancer, and wherein the ovarian cancer is non-metastatic, invasive or metastatic, wherein preferably the antibody has an antigen-binding region specific for the extracellular domain of R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74).

In another embodiment of the invention the cancer is selected from the group consisting of non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the lung, ovary, breast, prostate, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic.

In a preferred embodiment the autoimmune diseases include Multiple sclerosis; Psoriasis; Rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

It is a specific object of the invention to provide methods for treating or preventing rejection of any organ transplant and/or graft versus host disease, comprising administering to a patient an effective amount of an antibody that specifically bind AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74). It is also preferred in the foregoing methods that the antibody possess an antigen-binding region specific for the extracellular domain of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO: 74).

According to the present invention, each one of the following: the VSIG1 ectodomain, ILDR1 ectodomain, LOC253012 ectodomain, AI216611 ectodomain, C1ORF32 ectodomain or FXYD3 ectodomain of the present invention, antibodies and fragments that bind the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 of FXYD3 antigen, the compounds including drugs such as small molecules, peptides, as well as ribozymes or antisense or siRNAs which target the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 nucleic acid sequence or fragments or variants thereof which are useful for treatment or prevention of cancer, autoimmune disorders, transplant rejection, GVHD, and/or for blocking or enhancing immune co-stimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 polypeptide, can be used with simultaneous blockade of several co-stimulatory pathways or in combination therapy with conventional drugs, such as immunosuppressants or cytotoxic drugs for cancer.

It is another object of the invention to provide assays for detecting the presence of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74) protein in vitro or in vivo in a biological sample or individual comprising contacting the sample with an antibody having specificity for AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74) polypeptides, or a combination thereof, and detecting the binding of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74) protein in the sample.

It is another object of the invention to provide methods for detecting a disease, diagnosing a disease, monitoring disease progression or treatment efficacy or relapse of a disease, or selecting a therapy for a disease, comprising detecting expression of a AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74).

In a related object the detected diseases will include cancers such as lung cancer, ovarian cancer, colon cancer, as well as other non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

With regard to lung cancer, the disease is selected from the group consisting of non-metastatic, invasive or metastatic lung cancer; squamous cell lung carcinoma, lung adenocarcinoma, carcinoid, small cell lung cancer or non-small cell lung cancer; detection of overexpression in lung metastasis (vs. primary tumor); detection of overexpression in lung cancer, for example non small cell lung cancer, for example adenocarcinoma, squamous cell cancer or carcinoid, or large cell carcinoma; identification of a metastasis of unknown origin which originated from a primary lung cancer; assessment of a malignant tissue residing in the lung that is from a non-lung origin, including but not limited to: osteogenic and soft tissue sarcomas; colorectal, uterine, cervix and corpus tumors; head and neck, breast, testis and salivary gland cancers; melanoma; and bladder and kidney tumors; distinguishing between different types of lung cancer, therefore potentially affecting treatment choice (e.g. small cell vs. non small cell tumors); analysis of unexplained dyspnea and/or chronic cough and/or hemoptysis; differential diagnosis of the origin of a pleural effusion; diagnosis of conditions which have similar symptoms, signs and complications as lung cancer and where the differential diagnosis between them and lung cancer is of clinical importance including but not limited to: non-malignant causes of lung symptoms and signs, including but not limited to: lung lesions and infiltrates, wheeze, stridor, tracheal obstruction, esophageal compression, dysphagia, recurrent laryngeal nerve paralysis, hoarseness, phrenic nerve paralysis with elevation of the hemidiaphragm and Horner syndrome; or detecting a cause of any condition suggestive of a malignant tumor including but not limited to anorexia, cachexia, weight loss, fever, hypercalcemia, hypophosphatemia, hyponatremia, syndrome of inappropriate secretion of antidiuretic hormone, elevated ANP, elevated ACTH, hypokalemia, clubbing, neurologic-myopathic syndromes and thrombophlebitis.

With regard to ovarian cancer, the compounds of the present invention can be used in the diagnosis, treatment or prognostic assessment of non-metastatic, invasive or metastatic ovarian cancer; correlating stage and malignant potential; identification of a metastasis of unknown origin which originated from a primary ovarian cancer; differential diagnosis between benign and malignant ovarian cysts; diagnosing a cause of infertility, for example differential diagnosis of various causes thereof; detecting of one or more non-ovarian cancer conditions that may elevate serum levels of ovary related markers, including but not limited to: cancers of the endometrium, cervix, fallopian tubes, pancreas, breast, lung and colon; nonmalignant conditions such as pregnancy, endometriosis, pelvic inflammatory disease and uterine fibroids; diagnosing conditions which have similar symptoms, signs and complications as ovarian cancer and where the differential diagnosis between them and ovarian cancer is of clinical importance including but not limited to: non-malignant causes of pelvic mass, including, but not limited to: benign (functional) ovarian cyst, uterine fibroids, endometriosis, benign ovarian neoplasms and inflammatory bowel lesions; determining a cause of any condition suggestive of a malignant tumor including but not limited to anorexia, cachexia, weight loss, fever, hypercalcemia, skeletal or abdominal pain, paraneoplastic syndrome, or ascites.

In another related object the detected diseases will include autoimmune and neoplastic disorders selected from the group consisting of Multiple sclerosis; Psoriasis; Rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another related object the detected diseases will include rejection of any organ transplant and/or Graft versus host disease.

In a related aspect the foregoing assays will detect cells affected by the disease using the antibody that binds specifically to the AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74) protein wherein the assays may be effected in vitro or in vivo, and include RIA, ELISA, fluorimetric assays, FACS, slot blot, Western blot, immunohistochemical assays, radioimaging assays and the like. In some embodiments, this invention provides a method for diagnosing a disease in a subject, comprising detecting in the subject or in a sample obtained from said subject at least one polypeptide or polynucleotide selected from the group consisting of:

a polypeptide comprising an amino acid sequence as set forth in any one of SEQ ID NOs: 11-16, 21-34, 35-40, 43-44, 48-50, 70-76, 138-151, 296, 298-302;

a polypeptide comprising a bridge, edge portion, tail or head portion, of any one of SEQ. ID NOs: 284-295, or a homologue or a fragment thereof;

a polynucleotide comprising a nucleic acid sequence as set forth in any one of SEQ ID NOs: 1-10, 17-20, 25-34, 41-42, 45-46, 51-69;

a polynucleotide comprising a nucleic acid sequence encoding a polypeptide comprising a bridge, edge portion, tail or head portion, of any one of SEQ. ID NOs: 284-295;

an oligonucleotide having a nucleic acid sequence as set forth in SEQ. ID NOs: 187, 190, 193, 196, 199, 202, 205, 208, 211, 214, 217, 220, 223, 226, 229, 232, 235, 238, 241, 244, 247, 250, 253.

According to further embodiment, detecting a polypeptide of the invention comprises employing an antibody capable of specifically binding to at least one epitope of a polypeptide comprising an amino acid sequence of a polypeptide comprising a bridge, edge portion, tail, or head portion of any one of SEQ. ID NOs: 284-295. According to one embodiment, detecting the presence of the polypeptide or polynucleotide is indicative of the presence of the disease and/or its severity and/or its progress. According to another embodiment, a change in the expression and/or the level of the polynucleotide or polypeptide compared to its expression and/or level in a healthy subject or a sample obtained therefrom is indicative of the presence of the disease and/or its severity and/or its progress. According to a further embodiment, a change in the expression and/or level of the polynucleotide or polypeptide compared to its level and/or expression in said subject or in a sample obtained therefrom at earlier stage is indicative of the progress of the disease. According to still further embodiment, detecting the presence and/or relative change in the expression and/or level of the polynucleotide or polypeptide is useful for selecting a treatment and/or monitoring a treatment of the disease.

According to one embodiment, detecting a polynucleotide of the invention comprises employing a primer pair, comprising a pair of isolated oligonucleotides capable of specifically hybridizing to at least a portion of a polynucleotide having a nucleic acid sequence as set forth in SEQ. ID NOs: 187, 190, 193, 196, 199, 202, 205, 208, 211, 214, 217, 220, 223, 226, 229, 232, 235, 238, 241, 244, 247, 250, 253, or polynucleotides homologous thereto.

According to another embodiment, detecting a polynucleotide of the invention comprises employing a primer pair, comprising a pair of isolated oligonucleotides as set forth in SEQ. ID NOs:185-186, 188-189, 191-192, 194-195, 197-198, 200-201, 203-204, 206-207, 209-210, 212-213, 215-216, 218-219, 221-222, 224-225, 227-228, 230-231, 233-234, 236-237, 239-240, 242-243, 245-246, 248-249, 251-252.

The invention also includes the following specific embodiments.

In one embodiment the invention includes an isolated polypeptide selected from AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74) or a fragment or variant thereof that possesses at least 95, 96, 97, 98 or 99% sequence identity therewith.

In another embodiment the invention includes a fragment or conjugate comprising any one of the foregoing polypeptides.

In another embodiment the invention includes any one of the foregoing polypeptides fused to an immunoglobulin domain.

In another embodiment the invention includes any of the foregoing polypeptides attached to a detectable or therapeutic moiety.

In another embodiment the invention includes a nucleic acid sequence encoding any of the foregoing polypeptides.

In another embodiment the invention includes any of the nucleic acid sequences selected from AI581519_T10 (SEQ ID NO:9), AI581519_T11 (SEQ ID NO:10), AA424839_1_T7 (SEQ ID NO:20), H68654_1_T8 (SEQ ID NO:28), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), H68654_1_T20 (SEQ ID NO:34), AI216611_T1 (SEQ ID NO:42), H19011_1_T8 (SEQ ID NO:45), H19011_1_T9 (SEQ ID NO:46), R31375_T19 (SEQ ID NO:65); R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T39 (SEQ ID NO:69), or a fragment or variant and conjugates containing that possesses at least 95, 96, 97, 98 or 99% sequence identity therewith.

In another embodiment the invention includes an isolated VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 ectodomain polypeptide, or fragment or conjugate thereof.

In another embodiment the invention includes any of the foregoing polypeptides, comprising a sequence of amino acid residues having at least 95, 96, 97, 98 or 99% sequence identity with amino acid residues 23-234 of AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or amino acid residues 23-270 of AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or amino acid residues 23-296 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or amino acid residues 23-193 of AI581519_P7 (SEQ ID NO:14), corresponding to amino acid sequence depicted in SEQ ID NO:141, or amino acid residues 23-203 of AI581519_P9 (SEQ ID NO:15), corresponding to amino acid sequence depicted in SEQ ID NO:142, or amino acid residues 23-231 of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302, or amino acid residues 24-162 of AA424839_P3 (SEQ ID NO:22), or AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, or amino acid residues 24-456 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, or amino acid residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301, or amino acid residues 38-349 of H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), or H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, or amino acid residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298, corresponding to amino acid sequence depicted in SEQ ID NO:146, or amino acid residues 21-186 of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94), or amino acid residues 21-36 of R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297.

In another embodiment the invention includes any of the foregoing polypeptides, comprising the extracellular domain of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74).

In another embodiment the invention includes any of the foregoing polypeptides, attached to a detectable or therapeutic moiety.

In another embodiment the invention includes any of the foregoing nucleic acid sequences encoding any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain polypeptides and conjugates containing.

In another embodiment the invention includes an expression vector containing any of the foregoing nucleic acid sequences.

In another embodiment the invention includes a host cell comprising the foregoing expression vector or a virus containing a nucleic acid sequence encoding the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain polypeptide, or fragment or conjugate thereof, wherein the cell expresses the polypeptide encoded by the DNA segment.

In another embodiment the invention includes a method of producing anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain polypeptides, or fragment or conjugate thereof, comprising culturing the foregoing host cell, wherein the cell expresses the polypeptide encoded by the DNA segment or nucleic acid and recovering said polypeptide.

In another embodiment the invention includes any of the foregoing isolated soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain wherein said polypeptide blocks or inhibits the interaction of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof with a corresponding functional counterpart.

In another embodiment the invention includes the foregoing isolated soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomains, wherein said polypeptide replaces or augments the interaction of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74), or a fragment or variant or conjugate thereof with a corresponding functional counterpart.

In another embodiment the invention includes a fusion protein comprising any of the foregoing isolated soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain joined to a non-VSIG1, non-ILDR1, non-LOC253012, non-AI216611, non-C1ORF32, non-FXYD3 protein sequence, correspondingly.

In another embodiment the invention includes any of the foregoing fusion proteins, wherein the non-VSIG1, non-ILDR1, non-LOC253012, non-AI216611, non-C1ORF32, non-FXYD3 protein is at least a portion of an immunoglobulin molecule.

In another embodiment the invention includes any of the foregoing fusion proteins, wherein a polyalkyl oxide moiety such as polyethylene glycol is attached to the polypeptide.

In another embodiment the invention includes any of the foregoing fusion proteins, wherein the immunoglobulin heavy chain constant region is an Fc fragment.

In another embodiment the invention includes any one of the protein sequences of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECDs fused to mouse Fc, as set forth in any one of amino acid sequences as depicted in SEQ ID NOs: 103-108, or nucleic acid sequences encoding the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECDs fused to mouse Fc. The invention further includes the nucleic acid sequences encoding the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECDs fused to mouse Fc, as set forth in any one of nucleic acid sequences depicted in SEQ ID NOs:97-102.

In another embodiment the invention includes any of the foregoing fusion proteins wherein the immunoglobulin heavy chain constant region is an isotype selected from the group consisting of an IgG1, IgG2, IgG3, IgG4, IgM, IgE, IgA and IgD.

In another embodiment the invention includes any of the foregoing fusion proteins, wherein the polypeptide is fused to a VASP domain.

In another embodiment the invention includes any of the foregoing fusion proteins, wherein the fusion protein modulates lymphocyte activation.

In another embodiment the invention includes a pharmaceutical composition comprising any of the foregoing polynucleotide sequences and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a pharmaceutical composition comprising the foregoing vector and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a pharmaceutical composition comprising the foregoing host cell and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a pharmaceutical composition comprising any of the foregoing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomains and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a pharmaceutical composition comprising any of the foregoing polypeptides and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a pharmaceutical composition comprising the foregoing fusion protein and further comprising a pharmaceutically acceptable diluent or carrier.

In another embodiment the invention includes a method for treating or preventing cancer, comprising administering to a subject in need thereof a pharmaceutical composition comprising: a soluble molecule having the extracellular domain of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptide, or fragment or conjugate thereof; or polypeptide, comprising a sequence of amino acid residues having at least 95, 96, 97, 98 or 99% sequence identity with amino acid residues 23-234 of AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or amino acid residues 23-270 of AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or amino acid residues 23-296 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or amino acid residues 23-193 of AI581519_P7 (SEQ ID NO:14), corresponding to amino acid sequence depicted in SEQ ID NO:141, or amino acid residues 23-203 of AI581519_P9 (SEQ ID NO:15), corresponding to amino acid sequence depicted in SEQ ID NO:142, or amino acid residues 23-231 of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302, or amino acid residues 24-162 of AA424839_P3 (SEQ ID NO:22), or AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, or amino acid residues 24-456 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, or amino acid residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301, or amino acid residues 38-349 of H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), or H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, or amino acid residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298, corresponding to amino acid sequence depicted in SEQ ID NO:146, or amino acid residues 21-186 of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94), or amino acid residues 21-36 of R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297, or a nucleic acid sequence encoding the same.

In another embodiment the invention includes the foregoing method, wherein the cancer is selected from a group consisting of hematological malignancies such as acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, and soft or solid tumors such as cancer of breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method wherein the cancer is selected from the group consisting of lung cancer, ovarian cancer or colon cancer, and wherein the lung cancer, the ovarian cancer or the colon cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes a method for treating or preventing immune related conditions, such as autoimmune diseases or transplant rejection, comprising administering to a subject in need thereof a pharmaceutical composition comprising: a soluble molecule having the extracellular domain of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptide, or fragment or conjugate thereof; or polypeptide, comprising a sequence of amino acid residues having at least 95, 96, 97, 98 or 99% sequence identity with amino acid residues 23-234 of AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or amino acid residues 23-270 of AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or amino acid residues 23-296 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or amino acid residues 23-193 of AI581519_P7 (SEQ ID NO:14), corresponding to amino acid sequence depicted in SEQ ID NO:141, or amino acid residues 23-203 of AI581519_P9 (SEQ ID NO:15), corresponding to amino acid sequence depicted in SEQ ID NO:142, or amino acid residues 23-231 of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302, or amino acid residues 24-162 of AA424839_P3 (SEQ ID NO:22), or AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, or amino acid residues 24-456 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, or amino acid residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301, or amino acid residues 38-349 of H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), or H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, or amino acid residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298, corresponding to amino acid sequence depicted in SEQ ID NO:146, or amino acid residues 21-186 of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94), or residues 21-36 of R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297, or a nucleic acid sequence encoding the same.

In another embodiment the invention includes the foregoing method, wherein the autoimmune diseases are selected from a group consisting of multiple sclerosis; psoriasis; rheumatoid arthritis; systemic lupus erythematosus; ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection; benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another embodiment the invention includes the foregoing method, wherein the immune related disorders are selected from transplant rejection or graft versus host disease.

In another embodiment the invention includes an siRNA, antisense RNA, or ribozyme that binds the transcript encoding any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptides, selected from AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or a variant thereof, and inhibits its expression.

In another embodiment the invention includes a polyclonal or monoclonal antibody that specifically binds and/or modulates an activity elicited by any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptides, selected from AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or a variant thereof and conjugates containing.

In another embodiment the invention includes a monoclonal or polyclonal antibody or an antigen binding fragment thereof comprising an antigen binding site that binds specifically to any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptides comprised in AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or fragment or variant thereof that is at least 80% identical thereto.

In another embodiment the invention includes any of the foregoing antibodies or fragments thereof, wherein said antibody blocks or inhibits the interaction of one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof with a counterpart activity or function.

In another embodiment the invention includes any of the foregoing antibodies or fragments wherein said antibody replaces or augments the interaction of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof with a counterpart function or activity.

In another embodiment the invention includes a method for modulating lymphocyte activity, comprising contacting a AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74) positive lymphocyte with a bioactive agent capable of modulating VSIG1-mediated, ILDR1-mediated, LOC253012-mediated, AI216611-mediated, C1ORF32-mediated, or FXYD3-mediated signaling in an amount effective to modulate at least one lymphocyte activity.

In another embodiment the invention includes the foregoing method, wherein said agent comprises an antagonist of VSIG1-mediated, ILDR1-mediated, LOC253012-mediated, AI216611-mediated, C1ORF32-mediated signaling, or FXYD3-mediated signaling and wherein said contacting inhibits the attenuation of lymphocyte activity mediated by such signaling.

In another embodiment the invention includes the foregoing method, wherein said contacting increases lymphocyte activity.

In another embodiment the invention includes the foregoing method wherein said antagonist comprises a blocking agent capable of interfering with the functional interaction of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 antigen and its counterpart.

In another embodiment the invention includes the foregoing antibody or fragment which is suitable for treatment or prevention of cancer by modulating the activity of any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins in a B7-like co-stimulatory system.

In another embodiment the invention includes the foregoing method wherein the administered antibody or fragment inhibits negative stimulation of T cell activity against cancer cells.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the cancer is selected from the group consisting of hematological malignancies such as acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, and soft tissue or solid tumors such as cancer of breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes any of the foregoing antibodies or fragments, which are suitable for treatment or prevention of immune related disorders, such as autoimmune diseases or transplant rejection, by modulating the activity of anyone of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins in a B7-like co-stimulatory system.

In another embodiment the invention includes any of the foregoing antibodies or fragments, which are suitable for treating an autoimmune disease selected from multiple sclerosis; psoriasis; rheumatoid arthritis; Systemic lupus erythematosus; ulcerative colitis; Crohn's disease, immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitus, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another embodiment the invention includes any of the foregoing antibodies or fragments, suitable for treating transplant rejection or graft versus host disease.

In another embodiment the invention includes any of the foregoing antibodies or fragments, that specifically binds to amino-acids: 23-234 of AI581519_P3 (SEQ ID NO:11), corresponding to amino acid sequence depicted in SEQ ID NO:138, or amino acid residues 23-270 of AI581519_P4 (SEQ ID NO:12), corresponding to amino acid sequence depicted in SEQ ID NO:139, or amino acid residues 23-296 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:140, or amino acid residues 23-193 of AI581519_P7 (SEQ ID NO:14), corresponding to amino acid sequence depicted in SEQ ID NO:141, or amino acid residues 23-203 of AI581519_P9 (SEQ ID NO:15), corresponding to amino acid sequence depicted in SEQ ID NO:142, or amino acid residues 23-231 of AI581519_P10 (SEQ ID NO:16), corresponding to amino acid sequence depicted in SEQ ID NO:143, or residues 26-293 of AI581519_P5 (SEQ ID NO:13), corresponding to amino acid sequence depicted in SEQ ID NO:302, or amino acid residues 24-162 of AA424839_P3 (SEQ ID NO:22), or AA424839_P5 (SEQ ID NO:21), corresponding to amino acid sequence depicted in SEQ ID NO:75, or amino acid residues 24-456 of AA424839_P7 (SEQ ID NO:23), corresponding to amino acid sequence depicted in SEQ ID NO:76, or amino acid residues 24-105 of AA424839_1_P11 (SEQ ID NO:24), corresponding to amino acid sequence depicted in SEQ ID NO:296, or residues 50-160 of AA424839_1_P3 (SEQ ID NO:22), corresponding to amino acid sequence depicted in SEQ ID NO:301, or amino acid residues 38-349 of H68654_1_P2 (SEQ ID NO:35), corresponding to amino acid sequence depicted in SEQ ID NO:144, or residues 19-337 of H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), or H68654_1_P14 (SEQ ID NO:40), corresponding to amino acid sequence depicted in SEQ ID NO:145, or residues 1-335 of the sequences H68654_1_P5 (SEQ ID NO:36), corresponding to amino acid sequence depicted in SEQ ID NO:300, or amino acid residues 29-147 of the sequence AI216611_P0 (SEQ ID NO:43) or AI216611_P1 (SEQ ID NO:44), or residues 1-145 of the sequence AI216611_P0 (SEQ ID NO:43), corresponding to amino acid sequence depicted in SEQ ID NO:298, corresponding to amino acid sequence depicted in SEQ ID NO:146, or amino acid residues 21-186 of H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:147, or residues 21-169 of H19011_1_P9 (SEQ ID NO:50), corresponding to amino acid sequence depicted in SEQ ID NO:148, or residues 1-184 of the sequence H19011_1_P8 (SEQ ID NO:48), corresponding to amino acid sequence depicted in SEQ ID NO:299 (with non-silent SNPs as given in Table 94), or amino acid residues 21-36 of R31375_P0 (SEQ ID NO:70) or R31375_P31 (SEQ ID NO:73), corresponding to amino acid sequence depicted in SEQ ID NO:149, or residues 21-65 of R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:150, or residues 21-25 of R31375_P33 (SEQ ID NO:74), corresponding to amino acid sequence depicted in SEQ ID NO:151, or residues 1-63 of the sequence R31375_P14 (SEQ ID NO:72), corresponding to amino acid sequence depicted in SEQ ID NO:297, or a variant or fragment or an epitope thereof.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the antigen binding site contains from about 3-7 contiguous or non-contiguous amino acids, more typically at least 5 contiguous or non-contiguous amino acids. These binding sites include conformational and non-conformational epitopes.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the antibody is a fully human antibody.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the antibody is a chimeric antibody.

In another embodiment the invention includes the foregoing antibodies or fragments wherein the antibody is a humanized or primatized antibody.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the fragment is selected from the group consisting of Fab, Fab′, F(ab′)2, F(ab′), F(ab), Fv or scFv fragment and minimal recognition unit.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the antibody or fragment is coupled to a detectable marker, or to an effector moiety.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the effector moiety is an enzyme, a toxin, a therapeutic agent, or a chemotherapeutic agent.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the detectable marker is a radioisotope, a metal chelator, an enzyme, a fluorescent compound, a bioluminescent compound or a chemiluminescent compound.

In another embodiment the invention includes a pharmaceutical composition that comprises any of the foregoing antibodies or a fragment thereof.

In another embodiment the invention includes a pharmaceutical composition that comprises the foregoing antibodies or a fragment thereof.

In another embodiment the invention includes a method of inducing or enhancing an immune response, comprising administering to a patient in need thereof any of the foregoing antibodies or fragments and detecting induction or enhancement of said immune response.

In another embodiment the invention includes a method for potentiating a secondary immune response to an antigen in a patient, which method comprises administering effective amounts any of the foregoing antibodies or fragments.

In another embodiment the invention includes the foregoing method, wherein the antigen is preferably a cancer antigen, a viral antigen or a bacterial antigen, and the patient has preferably received treatment with an anticancer vaccine or a viral vaccine.

In another embodiment the invention includes a method of treating a patient with a VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 positive malignancy, comprising administering to the patient an effective amount of any of the foregoing antibodies or fragments.

In another embodiment the invention includes the foregoing method further comprising co-administering a chemotherapeutic agent.

In another embodiment the invention includes the foregoing method, wherein said malignancy is selected from a group consisting of hematological malignancies such as acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, and soft or solid tumors such as cancer of breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method, wherein said malignancy is selected from the group consisting of lung cancer, ovarian cancer, colon cancer, and wherein the lung cancer, the ovarian cancer or the colon cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes an assay for detecting the presence of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof in a biological sample comprising contacting the sample with an antibody of any one of the foregoing, and detecting the binding of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof in the sample.

In another embodiment the invention includes a method for detecting a disease, diagnosing a disease, monitoring disease progression or treatment efficacy or relapse of a disease, or selecting a therapy for a disease, comprising detecting expression of a AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof.

In another embodiment the invention includes the foregoing method wherein detecting expression AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof is performed in vivo or in vitro.

In another embodiment the invention includes the foregoing method, wherein the disease is selected from lung cancer, ovarian cancer, or colon cancer, and wherein the lung cancer, the ovarian cancer or the colon cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method, wherein the disease is multiple sclerosis; psoriasis; rheumatoid arthritis; Systemic lupus erythematosus; ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis or chondrocalcinosis.

In another embodiment the invention includes a method of inhibiting growth of cells that express a polypeptide selected from AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof in a subject, comprising: administering to said subject any of the foregoing antibodies or fragments.

In another embodiment the invention includes a method of treating or preventing cancer comprising the administration of a therapeutically effective amount of an antibody or binding fragment that specifically binds the AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof that possesses at least 80% sequence identity therewith.

In another embodiment the invention includes the foregoing method, wherein the cancer is selected from a group consisting of hematological malignancies such as acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, and soft or solid tumors such as cancer of breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method, wherein the cancer is selected from the group consisting of lung cancer, ovarian cancer, or colon cancer, and wherein the lung cancer, the ovarian cancer or the colon cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method wherein the antibody is a human, humanized or chimeric antibody or antigen binding fragment.

In another embodiment the invention includes the foregoing method wherein the antibody or fragment is attached directly or indirectly to an effector moiety.

In another embodiment the invention includes the foregoing method, wherein the effector is selected from a drug, toxin, radionuclide, fluorophore and an enzyme.

In another embodiment the invention includes a method for treating or preventing an immune disorder, such as autoimmune or transplant related disease, comprising administering to a patient a therapeutically effective amount of an antibody that specifically binds to AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof that possesses at least 80% sequence identity therewith.

In another embodiment the invention includes the foregoing method, wherein the antibody has an antigen-binding region specific for the extracellular domain of any one of said VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptides.

In another embodiment the invention includes the foregoing method, wherein the antibody or fragment modulates the B7/co-stimulatory system in a manner that inhibits positive stimulation of T cell activity that created an autoimmune effect.

In another embodiment the invention includes the foregoing method, wherein the treatment is combined with a moiety useful for treating autoimmune or transplant rejection conditions.

In another embodiment the invention includes the foregoing method, wherein the moiety is a cytokine antibody, cytokine receptor antibody, drug, or another immunomodulatory agent.

In another embodiment the invention includes the foregoing method, wherein the autoimmune diseases are selected from a group consisting of multiple sclerosis; psoriasis; rheumatoid arthritis; systemic lupus erythematosus; ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another embodiment the invention includes the foregoing method wherein the immune disorder is transplant rejection or graft versus host disease.

In another embodiment the invention includes a method of using an antibody or antigen binding fragment that specifically binds AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof for in vivo imaging of tumors or inflammatory sites characterized by the differential expression of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof.

In another embodiment the invention includes the foregoing method which is used in assessing cancer prognosis or a treatment protocol.

In another embodiment the invention includes a method for screening for a disease in a subject, comprising detecting in the subject or in a sample obtained from said subject a polypeptide having a sequence at least 85% homologous to the amino acid sequence as set forth in any one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or with a polypeptide having a sequence comprising the extracellular domain of any one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74).

In another embodiment the invention includes the foregoing method wherein screening for a disease comprises detecting the presence or severity of the disease, disorder or condition, or prognosis of the subject, or treatment selection for said subject, or treatment monitoring of said subject.

In another embodiment the invention includes the foregoing method, wherein the disease is a cancer, selected from the group consisting of lung cancer, ovarian cancer, colon cancer, and wherein the lung cancer, the ovarian cancer and the colon cancer is non-metastatic, invasive or metastatic.

In another embodiment the invention includes the foregoing method wherein the disease is autoimmune disease and is selected from multiple sclerosis; psoriasis; rheumatoid arthritis; systemic lupus erythematosus; ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection; benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another embodiment the invention includes the foregoing method, wherein the detection is conducted by immunoassay.

In another embodiment the invention includes the foregoing method, wherein the immunoassay utilizes an antibody which specifically interacts with the polypeptide having a sequence at least 85% homologous to the amino acid sequence as set forth in any one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), or with a polypeptide having a sequence comprising the extracellular domain of any one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), and R31375_P33 (SEQ ID NO:74).

In another embodiment the invention includes an antibody specific to AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), and R31375_P33 (SEQ ID NO:74), or a fragment or variant thereof that elicits apoptosis or lysis of cancer cells that express said protein.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein said apoptosis or lysis activity involves CDC or ADCC activity of the antibody.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the cancer cells are selected from a group consisting of hematological malignancies such as acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, and soft or solid tumors such as cancer of breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, and brain.

In another embodiment the invention includes any of the foregoing antibodies or fragments, wherein the cancer cells are lung, ovarian or colon cancer cells.

In another embodiment the invention relates to any of the foregoing isolated soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 ectodomain polypeptides, wherein said polypeptide or a fragment or variant thereof is used as an anti-cancer vaccine for cancer immunotherapy.

In another embodiment the invention relates to any an isolated polypeptide comprising an amino acid sequence having at least 80%, 85%, 90%, 95, 96, 97, 98 or 99%, 100% homologous to the sequence as that set forth in any one of SEQ. ID NOs: 284-295, or a fragment thereof.

In another embodiment the invention relates to any an isolated polynucleotide, comprising an amplicon having a nucleic acid sequence selected from the group consisting of 187, 190, 193, 196, 199, 202, 205, 208, 211, 214, 217, 220, 223, 226, 229, 232, 235, 238, 241, 244, 247, 250, 253, or polynucleotides homologous thereto.

In another embodiment the invention relates to any a primer pair, comprising a pair of isolated oligonucleotides capable of amplifying the above mentioned amplicon.

The primer pair, comprising a pair of isolated oligonucleotides having a sequence selected from the group consisting of SEQ. ID NOs: 185-186, 188-189, 191-192, 194-195, 197-198, 200-201, 203-204, 206-207, 209-210, 212-213, 215-216, 218-219, 221-222, 224-225, 227-228, 230-231, 233-234, 236-237, 239-240, 242-243, 245-246, 248-249, and 251-252.

A method for screening for a disease, disorder or condition in a subject, comprising detecting in the subject or in a sample obtained from said subject a polynucleotide having a sequence at least 85% homologous to the nucleic acid sequence as set forth in any one of SEQ ID NOs:1-10, 17-20, 25-34, 41-42, 45-46, 51-69, 187, 190, 193, 196, 199, 202, 205, 208, 211, 214, 217, 220, 223, 226, 229, 232, 235, 238, 241, 244, 247, 250, and 253.

The method as above, wherein screening for a disease comprises detecting the presence or severity of the disease, disorder or condition, or prognosis of the subject, or treatment selection for said subject, or treatment monitoring.

The method as above, wherein the disease is a cancer, selected from the group consisting of lung cancer, colon cancer and ovarian cancer, and wherein the lung cancer, colon cancer and ovarian cancer is non-metastatic, invasive or metastatic.

The method as above, wherein the disease is autoimmune disease.

The method as above, wherein the detection is performed using an oligonucleotide pair capable of hybridizing to at least a portion of a nucleic acid sequence at least 85% homologous to the nucleic acid sequence set forth in SEQ. ID NO: 1-10, 17-20, 25-34, 41-42, 45-46, 51-69, 187, 190, 193, 196, 199, 202, 205, 208, 211, 214, 217, 220, 223, 226, 229, 232, 235, 238, 241, 244, 247, 250, and 253.

The method as above wherein the detection is performed using an oligonucleotide pair as set forth in any one of SEQ. ID NOs: 185-186, 188-189, 191-192, 194-195, 197-198, 200-201, 203-204, 206-207, 209-210, 212-213, 215-216, 218-219, 221-222, 224-225, 227-228, 230-231, 233-234, 236-237, 239-240, 242-243, 245-246, 248-249, and 251-252.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows a schematic summary of quantitative real-time PCR analysis.

FIG. 2 shows a scatter plot, demonstrating the expression of AI581519 transcripts, that encode the VSIG1 proteins, on a virtual panel of all tissues and conditions using MED discovery engine, demonstrating overexpression of AI581519 transcripts in lung cancer compared to normal lung samples.

FIGS. 3A-3E show alignment comparison of the AI581519_P4 (FIG. 3A), AI581519_P5 (FIG. 3B), AI581519_P7 (FIG. 3C), AI581519_P9 (FIG. 3D), and AI581519_P10 (FIG. 3E) proteins to the known VSIG1 proteins NP_872413 (SEQ ID NO: 11) and Q86XK7_HUMAN.

FIG. 4 presents a histogram showing expression of V-set and immunoglobulin domain containing 1 (VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7 (SEQ ID NO: 190) in normal and cancerous Ovary tissues.

FIG. 5 presents a histogram showing expression of V-set and immunoglobulin domain containing 1 (VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7 (SEQ ID NO: 190) in normal and cancerous lung tissues.

FIGS. 6A-6B present histograms showing expression of V-set and immunoglobulin domain containing 1 (VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7 (SEQ ID NO: 190) in various normal tissues. FIG. 6A shows expression of each sample relative to median of the ovary samples; FIG. 6B shows expression of each sample relative to median of the lung samples.

FIG. 7 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7-9 (SEQ ID NO: 187) in normal and cancerous Ovary tissues.

FIG. 8 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7-9 (SEQ ID NO: 187) in normal and cancerous lung tissues.

FIG. 9 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519seg7-9 (SEQ ID NO: 196) in blood-specific panel.

FIG. 10 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_junc7-11F2R2 (SEQ ID NO: 193) in normal and cancerous lung tissues.

FIG. 11 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_junc7-11F2R2 (SEQ ID NO: 193) in normal and cancerous ovarian tissues.

FIGS. 12A-12B present histograms showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_junc7-11F2R2 (SEQ ID NO: 193) in various normal tissues. FIG. 12A shows expression of each sample relative to median of the lung samples; FIG. 12B shows expression of each sample relative to median of the ovary samples.

FIG. 13 presents a histogram showing expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_junc7-11F2R2 (SEQ ID NO: 193) in blood-specific panel.

FIG. 14 shows a scatter plot, demonstrating the expression of AA424839 transcripts, that encode the ILDR1 proteins, on a virtual panel of all tissues and conditions using MED discovery engine, demonstrating overexpression of AA424839 transcripts in ovarian cancer compared to normal ovary samples.

FIG. 15 shows a scatter plot, demonstrating the expression of AA424839 transcripts, that encode the ILDR1 proteins, on a virtual panel of all tissues and conditions using MED discovery engine, demonstrating overexpression of AA424839 transcripts in colon cancer compared to normal colon samples.

FIGS. 16A-16C show alignment comparison of the AA424839_P3 (FIG. 16A), AA424839_P7 (FIG. 16B), and AA424839_1_P11 (FIG. 16C), proteins to the known ILDR1 proteins Q86SU0_HUMAN and NP_787120 (SEQ ID NO: 21).

FIG. 17 presents a histogram showing expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg18wt (SEQ ID NO: 199) in normal and cancerous ovary tissues.

FIG. 18 presents a histogram showing expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg18wt (SEQ ID NO: 199) in various normal tissues.

FIG. 19 presents a histogram showing expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg14-16 (SEQ ID NO: 202) in normal and cancerous ovary tissues.

FIG. 20 presents a histogram showing expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg14-16 (SEQ ID NO: 202) in various normal tissues.

FIG. 21 presents a histogram showing expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839seg11-14F3R3 (SEQ ID NO: 205) in blood-specific panel.

FIG. 22 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc4-6F2R2 (SEQ ID NO: 208) in normal and cancerous colon tissues.

FIG. 23 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc4-6F2R2 (SEQ ID NO: 208) in various normal tissues.

FIG. 24 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_seg2WT (SEQ ID NO: 211) in normal and cancerous colon tissues.

FIG. 25 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_seg2WT (SEQ ID NO: 211) in various normal tissues.

FIG. 26 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc4-6 (SEQ ID NO: 214) in blood-specific pane.

FIG. 27 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220) in blood-specific pane.

FIG. 28 presents a histogram showing expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220) in normal and cancerous colon tissues.

FIG. 29 shows a scatter plot, demonstrating the expression of H68654 transcripts, that encode the LOC253012 proteins, on a virtual panel of all tissues and conditions using MED discovery engine, demonstrating overexpression of H68654 transcripts in lung cancer compared to normal lung samples.

FIGS. 30A-30H show alignment comparison of the H68654_1_P7 (FIG. 30A and FIG. 30B), H68654_1_P12 (FIGS. 3C and 30D), H68654_1_P13 (FIG. 3E and FIG. 30F), and H68654_1_P14 (FIGS. 30G and 30H) proteins to the known LOC253012 proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36).

FIG. 31 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg3WTF2R2 (SEQ ID NO: 226) in normal and cancerous Lung tissues.

FIG. 32 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg3WTF2R2 (SEQ ID NO: 226) in various normal and tissues.

FIG. 33 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg7-12WT (SEQ ID NO: 223) in normal and cancerous Lung tissues.

FIG. 34 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg7-12WT (SEQ ID NO: 223) in various normal and tissues.

FIG. 35A presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654seg3F2R2 (SEQ ID NO: 226) in blood-specific panel.

FIG. 35B presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654seg7-12F1R1 (SEQ ID NO: 223) in blood-specific panel.

FIG. 36 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg0-3 (SEQ ID NO: 229) in normal and cancerous Lung tissues.

FIG. 37 presents a histogram showing expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg2-3 (SEQ ID NO: 232) in normal and cancerous Lung tissues.

FIGS. 38A-38B show alignment comparison of the H19011_1_P8 (FIG. 38A) and H19011_1_P9 (FIG. 38B) proteins to the known C1ORF32 proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47).

FIG. 39 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg13F2R2 (SEQ ID NO: 235) in normal and cancerous Colon tissues.

FIG. 40 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg13F2R2 (SEQ ID NO: 235) in normal and cancerous lung tissues.

FIGS. 41A-41B present a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg13F2R2 (SEQ ID NO: 235) in various normal tissues. FIG. 41A shows expression of each sample relative to median of the colon samples; FIG. 41B shows expression of each sample relative to median of the lung samples.

FIG. 42 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg8-13F1R1 (SEQ ID NO: 238) in normal and cancerous lung tissues.

FIG. 43 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc8-10seg13 (SEQ ID NO: 241) in normal and cancerous lung tissues.

FIG. 44 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc8-10seg13 (SEQ ID NO: 241) in normal and cancerous colon tissues.

FIGS. 45A-45B presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc8-10seg13 (SEQ ID NO: 241) in various normal tissues. FIG. 45A shows expression of each sample relative to median of the colon samples; FIG. 45B shows expression of each sample relative to median of the lung samples.

FIG. 46 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc8-10seg13 (SEQ ID NO: 241) in blood-specific panel.

FIG. 47 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc6-10F1R1 (SEQ ID NO: 244) in normal and cancerous lung tissues.

FIG. 48 presents a histogram showing expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc6-10F1R1 (SEQ ID NO: 244) in normal and cancerous colon tissues.

FIGS. 49A-49E show alignment comparison of the R31375_P14 (FIG. 49A), R31375_P31 (FIGS. 49B and 49C) and R31375_P33 (FIGS. 49D and 49E), proteins to the known FXYD3 proteins NP_068710 (SEQ ID NO: 71), FXYD3_HUMAN (SEQ ID NO: 70), NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70).

FIG. 50 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc30-33 (SEQ ID NO: 247) in normal and cancerous ovary tissues.

FIG. 51 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc30-33 (SEQ ID NO: 247) in various normal tissues.

FIG. 52 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_seg33junc34-37 (SEQ ID NO: 250) in normal and cancerous ovary tissues.

FIG. 53 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_seg33junc34-37 (SEQ ID NO: 250) in various normal tissues.

FIG. 54 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc20-22seg30F6R6 (SEQ ID NO: 253) in normal and cancerous ovary tissues.

FIG. 55 presents a histogram showing expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc20-22seg30F6R6 (SEQ ID NO: 253) in various normal tissues.

FIG. 56A-56J presents the nucleotide sequences of the recombinant full length_EGFP ORFs: gene specific sequence correspond to the candidate's full length sequence is marked in bold, EGFP sequence is unbold Italic and known SNPs/silence mutation are underlined. FIG. 56A presents the full length_EGFP ORF nucleic acid sequence of FXYD3_T0_P0_EGFP DNA (996bp) (SEQ ID NO:77); FIG. 56B presents the full length_EGFP ORF nucleic acid sequence of FXYD3_T25_P14_EGFP DNA (1083bp) (SEQ ID NO:78); FIG. 56C presents the full length_EGFP ORF nucleic acid sequence of AI216611_T0_P0_EGFP DNA (1371bp) (SEQ ID NO:79); FIG. 56D presents the full length_EGFP ORF nucleic acid sequence of AI216611_T1_P1_EGFP DNA (1332bp) (SEQ ID NO:80); FIG. 56E presents the full length_EGFP ORF nucleic acid sequence of C1ORF32_T8_P8 EGFP DNA (1533bp) (SEQ ID NO:81); FIG. 56F presents the full length_EGFP ORF nucleic acid sequence of LOC253012_T4_P5_EGFP DNA (2085bp) (SEQ ID NO:82); FIG. 56G presents the full length_EGFP ORF nucleic acid sequence of ILDR1_T0_P3 EGFP DNA (2373bp) (SEQ ID NO:83); FIG. 56H presents the full length_EGFP ORF nucleic acid sequence of ILDR1_T2_P5_EGFP DNA (2241bp) (SEQ ID NO:84); FIG. 56I presents the full length_EGFP ORF nucleic acid sequence of VSIG1_T6_P5_EGFP DNA (2082bp) (SEQ ID NO:85); FIG. 56J presents the full length_EGFP ORF nucleic acid sequence of VSIG1_T5_P4 EGFP DNA (2004bp) (SEQ ID NO:86).

FIG. 57A-57J presents the sequences of the full length_EGFP fusion proteins of invention. Candidate's specific sequence corresponding to the full length sequence of the protein is marked in bold, EGFP sequence is unbold Italic and amino acids modified due to known SNPs are underlined. FIG. 57A presents the full length_EGFP ORF amino acid sequence of FXYD3_P0_EGFP protein (331aa) (SEQ ID NO:87); FIG. 57B presents the full length_EGFP ORF amino acid sequence of FXYD3_P14_EGFP protein (360aa) (SEQ ID NO:88); FIG. 57C presents the full length_EGFP ORF amino acid sequence of AI216611_P0_EGFP protein (456aa) (SEQ ID NO:89); FIG. 57D presents the full length_EGFP ORF amino acid sequence of AI216611_P1_EGFP protein (443aa) (SEQ ID NO:90); FIG. 57E presents the full length_EGFP ORF amino acid sequence of C1ORF32_P8 EGFP protein (510aa) (SEQ ID NO:91); FIG. 57F presents the full length_EGFP ORF amino acid sequence of LOC253012_P5_EGFP protein (694aa) (SEQ ID NO:92); FIG. 57G presents the full length_EGFP ORF amino acid sequence of ILDR1_P3 EGFP protein (790aa) (SEQ ID NO:93); FIG. 57H presents the full length_EGFP ORF amino acid sequence of ILDR1_P5_EGFP protein (746aa) (SEQ ID NO:94); FIG. 57I presents the full length_EGFP ORF amino acid sequence of VSIG1_P5_EGFP protein (693aa) (SEQ ID NO:95); FIG. 57J presents the full length_EGFP ORF amino acid sequence of VSIG1_P4_EGFP protein (667aa) (SEQ ID NO:96).

FIGS. 58A-58F demonstrate the localization of the proteins of invention to cell membrane: FIG. 58A shows cellular localization of AI216611-EGFP_T0_P0 and AI216611-EGFP_T1_P1 proteins FIG. 58B shows cellular localization of FXYD3-EGFP_T0_P0 and FXYD3-EGFP_T25_P14 proteins. FIG. 58C shows cellular localization of C1ORF32-EGFP_T8_P8 protein. FIG. 58D shows cellular localization of LOC253012-EGFP_T4_P5 protein. FIG. 58E shows cellular localization of VSIG1-EGFP_T6_P5 and VSIG1-EGFP_T5_P4 proteins. FIG. 58F shows cellular localization of ILDR1-EGFP_T0_P3 and ILDR1-EGFP_T2_P5 proteins. All the images were obtained using the 40× objective of the confocal microscope.

FIGS. 59A-59F present the nucleotide sequences of the extracellular domains of the candidate proteins of the invention, fused to mouse Fc: ECD_mFc ORFs. Candidate protein's specific sequence corresponding to the ECD sequence is marked in bold, TEV cleavage site sequence is underlined, mFc sequence is unbold Italic and IL6sp sequence is bold Italic. FIG. 59A shows the FXYD3_T25_P14 ECD_mFc DNA sequence (924bp) (SEQ ID NO:97); FIG. 59B shows the AI216611_T0_P0 ECD_mFc DNA sequence (1170bp) (SEQ ID NO:98), FIG. 59C shows the C1ORF32_T8_P8_ECD_mFc DNA sequence (1287bp) (SEQ ID NO:99); FIG. 59D shows the LOC253012_T4_P5_ECD_mFc DNA sequence (1740bp) (SEQ ID NO:100), FIG. 59E shows the ILDR1_T0_P3_ECD_mFc DNA sequence (1167bp) (SEQ ID NO:101), and FIG. 59F shows the VSIG1_T6_P5_ECD_mFc DNA sequence (1641bp) (SEQ ID NO:102).

FIGS. 60A-60F present the amino acid sequence of the ECD_mFc fusion proteins. Candidate protein's specific sequence corresponding to the ECD sequence is marked in bold, TEV cleavage site sequence is underlined, mFc sequence is unbold Italic and IL6sp sequence is bold Italic. FIG. 60A shows the FXYD3_T25_P14_ECD_mFc amino acid sequence (307aa) (SEQ ID NO:103); FIG. 60B shows the AI216611_T0_P0_ECD_mFc amino acid sequence (389aa) (SEQ ID NO:104), FIG. 60C shows the C1ORF32_T8_P8_ECD_mFc amino acid sequence (428aa) (SEQ ID NO:105); FIG. 60D shows the LOC253012_T4_P5_ECD_mFc amino acid sequence (579aa) (SEQ ID NO:106), FIG. 60E shows the ILDR1_T0_P3_ECD_mFc amino acid sequence (388aa) (SEQ ID NO:107), and FIG. 60F shows the VSIG1_T6_P5_ECD_mFc amino acid sequence (546aa) (SEQ ID NO:108).

FIG. 61 shows the results of a western blot analysis of the expressed FXYD3_ECD_mFc (SEQ ID NO:103), AI216611 ECD_mFc (SEQ ID NO:104), C1ORF32_ECD_mFc (SEQ ID NO:105), LOC253012_ECD_mFc (SEQ ID NO:106), ILDR1_ECD_mFc (SEQ ID NO:107), VSIG1_ECD_mFc (SEQ ID NO:108). The lanes are as follows: lane 1 Molecular weight markers (Amersham, full range ranbow, catalog number RPN800); lane 2—LOC253012_ECD_mFc; lane 3-FXYD3_ECD_mFc; lane 4—AI216611 ECD_mFc; lane 5—C1ORF32_ECD_mFc; lane 6—ILDR1_ECD_mFc; lane 7—VSIG1_ECD_mFc.

FIGS. 62A-62E present the binding of the Fc-fused B7-like proteins ECDs to resting T cells or T cells activated with Con A for different periods of time. FIG. 62A shows the binding results for Fc-fused VSIG1 ECD; FIG. 62B shows the binding results for Fc-fused LOC253012; FIG. 62C shows the binding results for Fc-fused C1ORF32 ECD; and FIG. 62D shows the binding results for Fc-fused AI216611 ECD. FIG. 62E shows the binding results for Fc-fused FXYD3 ECD.

FIG. 63 presents the dose response of the binding of Fc-fused B7-like proteins ECDs to activated T cells. Purified T cells were cultured for 48 hours. Con A was added for the last 24 hours. Cells were then harvested and stained with increasing concentrations (3, 6, 12, 25 and 50 μg/ml) of Fc-fused VSIG1, LOC253012, C1ORF32, AI216611, ILDR1 or FXYD3 ECDs. As negative controls, mouse IgG2a was used at the same concentrations.

FIGS. 64A-64B present the effect of the ECD-Fc fused proteins on T cells proliferation or IL-2 secretion, upon activation with anti-CD3 Ab. FIG. 64A shows the levels of BrdU incorporation. FIG. 64B shows the levels of IL-2 secretion.

FIGS. 65A-1, 65A-2 and 65B illustrate the binding of the Fc-fused ECDs of the VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 to lymphocytes.

FIGS. 66A-1, 66A-2 and 66B illustrate the binding of the Fc-fused ECDs of the ILDR1, C1ORF32 and AI216611 to CD4+ T cells.

FIG. 67 shows the effect of B7-like proteins on T cell activation. “CD3” means CD3 only without the presence of a costimulatory molecule; “CD3+B7.2” means CD3+a known B7 stimulatory control, B7.2; “CD3+B7H4” means CD3 and B7H4 a known B7 inhibitory control; “CD3+B7H3” means CD3 and B7H3 a known B7 stimulatory protein; “CD3+702” means CD3+LOC253012-ECD-Fc fused (SEQ ID NO:106); “CD3+721” means CD3+AI216611-ECD-Fc fused (SEQ ID NO:104); “CD3+754” means CD3+C1ORF32-ECD-Fc fused (SEQ ID NO:105); “CD3+768” means CD3+VSIG1-ECD-Fc fused (SEQ ID NO:108) “CD3+770” means CD3+ILDR1-ECD-Fc fused (SEQ ID NO:107); “CD3+789” means CD3+FXYD3-ECD-Fc fused (SEQ ID NO:103). FIGS. 67A, B and C present 3 different experiments of 3 different donors

FIG. 68A presents FACS results of binding of ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) to resting B cells

FIG. 68B presents FACS results of binding of binding of ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) to activated B cells.

FIG. 68C presents FACS results of binding of ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) to B lymphoma cell lines.

FIG. 69 shows BIACORE results demonstrating interaction between AI216611 and B7H4.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to any one of the antigens referred to as VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, and its corresponding nucleic acid sequence, and portions and variants thereof and conjugates containing and the use thereof as a therapeutic or diagnostic target. In particular the invention uses this antigen and discrete portions thereof as a drug target for therapeutic small molecules, peptides, antibodies, antisense RNAs, siRNAs, ribozymes, and the like. More particularly the invention relates to diagnostic and therapeutic polyclonal and monoclonal antibodies and fragments thereof that bind VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 and portions and variants thereof, especially those that target the ectodomain or portions or variants thereof particularly human or chimeric monoclonal antibodies, that bind specifically to the antigen AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P33 (SEQ ID NO:74), and variants thereof including those that promote or inhibit activities elicited by VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, including those relating to modulation of immune costimulation, e.g. B7 related costimulation.

In certain embodiments, the antibodies of the invention are derived from particular heavy and light chain germline sequences and/or comprise particular structural features such as CDR regions comprising particular amino acid sequences. The invention provides isolated antibodies, methods of making such antibodies, immunoconjugates and bispecific molecules comprising such antibodies and pharmaceutical and diagnostic compositions containing the antibodies, immunoconjugates or bispecific molecules of the invention.

The invention also relates to in vitro and in vivo methods of using the antibodies and fragments, to detect VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, as well as to treat diseases associated with expression of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3, such as malignancies that differentially express VSIG1. The invention further relates to methods of using the antibodies and fragments, specific for VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 to treat autoimmune disorders and transplant and graft versus host disease. Accordingly, the invention also provides methods of using the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, anti-FXYD3 antibodies of the invention and other drugs that modulate VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 to treat malignancies for example, in the treatment of lung cancer, ovarian cancer, colon cancer, non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic. The invention further provides methods of using the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, anti-FXYD3 antibodies of the invention and other drugs that modulate VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 to treat non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease. Preferably these antibodies will possess ADCC or CDC activity against target cells such as cancer cells.

Also, the invention relates to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 antigen and portions thereof including soluble polypeptide conjugates containing the ectodomain of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 and/or the corresponding DNAs or vectors or cells expressing same for use in immunotherapy. Further the invention provides vectors, cells containing and use thereof for the expression of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 antigen, as well as discrete portions and variants thereof. Also, the invention provides non-antibody based VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 modulatory agents such as peptides, antisense RNAs, siRNAs, carbohydrates, and other small molecules that specifically bind and/or modulate a VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 related activity.

In order that the present invention may be more readily understood, certain terms are first defined. Additional definitions are set forth throughout the detailed description.

The terms VSIG1 refers to the protein encoded by any one of the AI581519_T0 (SEQ ID NO:1), AI581519_T1 (SEQ ID NO:2), AI581519_T2 (SEQ ID NO:3), AI581519_T3 (SEQ ID NO:4), AI581519_T4 (SEQ ID NO:5), AI581519_T5 (SEQ ID NO:6), AI581519_T6 (SEQ ID NO:7), AI581519_T8 (SEQ ID NO:8), AI581519_T10 (SEQ ID NO:9), AI581519_T11 (SEQ ID NO:10) transcripts reported herein, particularly to proteins as set forth in any one of AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), and variants thereof that are differentially expressed e.g., in cancers such as lung cancer and ovarian cancer, wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

The term ILDR1 refers to the to the protein encoded by any one of the AA424839_T0 (SEQ ID NO:17), AA424839_T2 (SEQ ID NO:18), AA424839_T4 (SEQ ID NO:19), AA424839_1_T7 (SEQ ID NO:20) transcripts reported herein, particularly to proteins as set forth in any one of AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), AA424839_1_P11 (SEQ ID NO:24), and variants thereof that are differentially expressed e.g., in cancer such as colon cancer and ovarian cancer wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

The term LOC253012 refers to the protein encoded by any one of the H68654_1_T0 (SEQ ID NO:25), H68654_1_T4 (SEQ ID NO:26), H68654_1_T5 (SEQ ID NO:27), H68654_1_T8 (SEQ ID NO:28), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), H68654_1_T20 (SEQ ID NO:34) transcripts reported herein, particularly to proteins as set forth in any one of H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), and variants thereof that are differentially expressed e.g., in cancers such as lung cancer, especially small cell lung carcinoma, wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

The term AI216611 refers to the protein encoded by any one of the AI216611_T0 (SEQ ID NO:41), AI216611_T1 (SEQ ID NO:42) transcripts reported herein, particularly to proteins as set forth in any one of AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), and variants thereof that are differentially expressed e.g., in cancers such as non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, and brain and wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

The terms C1ORF32 refers to the protein encoded by any one of the H19011_1_T8 (SEQ ID NO:45), H19011_1_T9 (SEQ ID NO:46) transcripts reported herein, particularly to proteins as set forth in any one of H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), and variants thereof that are differentially expressed e.g., in cancers such as lung cancer, particularly lung small cell carcinoma, wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

The term FXYD3 refers to the protein encoded by any one of the R31375 T0 (SEQ ID NO:51); R31375_T1 (SEQ ID NO:52); R31375_T10 (SEQ ID NO:61); R31375_T11 (SEQ ID NO:62); R31375_T12 (SEQ ID NO:63); R31375_T13 (SEQ ID NO:64); R31375_T2 (SEQ ID NO:53); R31375_T3 (SEQ ID NO:54); R31375_T4 (SEQ ID NO:55); R31375_T5 (SEQ ID NO:56); R31375_T6 (SEQ ID NO:57); R31375_T7 (SEQ ID NO:58); R31375_T8 (SEQ ID NO:59); R31375_T9 (SEQ ID NO:60): R31375_T19 (SEQ ID NO:65); R31375_T25 (SEQ ID NO:66); R31375_T26 (SEQ ID NO:67); R31375_T29 (SEQ ID NO:68); R31375_T39 (SEQ ID NO:69) transcripts reported herein, particularly to proteins as set forth in any one of R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73), R31375_P31 (SEQ ID NO:73), and variants thereof that are differentially expressed e.g., in cancers such as ovarian cancer as well as other non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic.

Preferably such VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 variants will possess at least 80% sequence identity therewith, more preferably at least 90% sequence identity therewith and even more preferably at least 95% sequence identity therewith.

Any one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 proteins based on its domain structure is predicted to be an immune costimulatory protein, e.g., a B7 protein family member that is involved in B7 immune co-stimulation including for example T cell responses elicited against cancer cells and that elicit effects on immunity such as triggering of autoimmune effects.

The term the “soluble ectodomain (ECD)” or “ectodomain” of VSIG1 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of VSIG1 protein):

>AI581519_P3 (SEQ IDS NO: 11) residues 23 to 234 (SEQ ID NO: 138) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISI YFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSGIYICDVNNPP DFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPVY YWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSSC EIDLTSSHPEVG >AI581519_P4 (SEQ IDS NO: 12) residues 23 to 270 (SEQ ID NO: 139) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISH SSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSEIYFSQGGQAVAIGQF KDRITGSNDPGNASITISHMQPADSGIYICDVNNPPDFLGQNQGILNVSV LVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPVYYWHKLEGRDIVPVK ENFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSSCEIDLTSSHPEVG >AI581519_P5 (SEQ IDS NO: 13) residues 23 to 296 (SEQ ID NO: 140) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISH SSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSESPWEEGKWPDVEAVK GTLDGQQAELQIYFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPAD SGIYICDVNNPPDFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLS CLSALGTPSPVYYWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQ CTAINRLGNSSCEIDLTSSHPEVG >AI581519_P7 (SEQ IDS NO: 14) residues 23 To 193 (SEQ ID NO: 141) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISI YFSQGGQAVAIGQFKDRITGSNDPVKPSKPLCSVQGRPETGHTISLSCLS ALGTPSPVYYWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCTA INRLGNSSCEIDLTSSHPEVG >AI581519_P9 (SEQ IDS NO: 15) residues 23 to 203 (SEQ ID NO: 142) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISI YFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSGIYICDVNNPP DFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPVY YWHKLEGRDIVPVKENFTNHRDFGHWKSDKF >AI581519_P10 (SEQ IDS NO: 16) residues 23 To 231 (SEQ ID NO: 143) QVTIPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISI YFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSGIYICDVNNPP DFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPVY YWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSSC EIDLTSSRQ, >AI581519_P5 (SEQ IDS NO: 13) residues 26 To 293 (SEQ ID NO: 302) IPDGFVNVTVGSNVTLICIYTTTVASREQLSIQWSFFHKKEMEPISHSSC LSTEGMEEKAVSQCLKMTHARDARGRCSVVTSESPWEEGKWPDVEAVKGT LDGQQAELQIYFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSG IYICDVNNPPDFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCL SALGTPSPVYYWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCT AINRLGNSSCEIDLTSSHP,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term the “soluble ectodomain (ECD)” or “ectodomain” of ILDR1 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of ILDR1 protein:

residues 24-105 of AA424839_1_P11 (SEQ ID NO: 24): SEQ ID NO: 296 ALSLGQDPSNDCNDNQREVRIVAQRRGQNEPVLGVDYRQRKITIQNRA DLVINEVMWWDHGVYYCTIEAPGDTSGDPDKEVK (SEQ ID NO: 296); residues 24-162 of AA424839_P3 (SEQ ID NO: 22) and AA424839_P5 (SEQ ID NO: 21): SEQ ID NO: 75 LLVTVQHTERYVTLFASIILKCDYTTSAQLQDVVVTWRFKSFCKDPIFDY YSASYQAALSLGQDPSNDCNDNQREVRIVAQRRGQNEPVLGVDYRQRKIT IQNRADLVINEVMVVVVDHGVYYCTIEAPGDTSGDPDKEVK: residues 24-457 of AA424839_P7 (SEQ ID NO: 23): SEQ ID NO: 76 LLVTVQHTERYVTLFASIILKCDYTTSAQLQDVVVTWRFKSFCKDPIFDY YSASYQAALSLGQDPSNDCNDNQREVRIVAQRRGQNEPVLGVDYRQRKIT IQNPLARHRYMKQAQALGPQMMGKPLYWGADRSSQVSSYPMHPLLQRDLS LPSSLPQMPMTQTTNQPPIANGVLEYLEKELRNLNLAQPLPPDLKGRFGH PCSMLSSLGSEVVERRIIHLPPLIRDLSSSRRTSDSLHQQWLTPIPSRPW DLREGRSHHHYPDFHQELQDRGPKSWALERRELDPSWSGRHRSSRLNGSP IHWSDRDSLSDVPSSSEARWRPSHPPFRSRCQERPRRPSPRESTQRHGRR RRHRSYSPPLPSGLSSWSSEEDKERQPQSWRAHRRGSHSPHWPEEKPPSY RSLDITPGKNSRKKGSVERRSEKDSSHSGRSVVI; residues 50-160 of AA424839_P3 (SEQ ID NO: 36): SEQ ID NO: 301 AQLQDVVVTWRFKSFCKDPIFDYYSASYQAALSLGQDPSNDCNDNQRE VRIVAQRRGQNEPVLGVDYRQRKITIQNRADLVINEVMWWDHGVYYCT IEAPGDTSGDPDKE,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term the “soluble ectodomain (ECD)” or “ectodomain” of LOC253012 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of LOC253012 protein):

H68654_1_P2 (SEQ ID NO: 35) residues 38-349: (SEQ ID NO: 144) SHTVHGVRGQALYLPVHYGFHTPASDIQIIVVLFERPHTMPKYLLGSVNK SVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQGNGTLSASQ KIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLAYQWLKNG RPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMESDIIMPI IYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYSWIRRT DNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTVIITS VGLEKLAQKGKSL; H68654_1_P5 (SEQ ID NO: 36), H68654_1_P7 (SEQ ID NO: 37), H68654_1_P12 (SEQ ID NO: 38), H68654_ 1_P13 (SEQ ID NO: 39), H68654_1_P14 (SEQ ID NO: 40) residues 19-337: (SEQ ID NO: 145) GLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMPKYL LGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQGNG TLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLAY QWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMES DIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTY SWIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHF TVIITSVGLEKLAQKGKSL, H68654_1_P5 (SEQ ID NO: 36) residues 1-335 (SEQ ID NO: 300): MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPAS DIQIIWLFERPHTMPKYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPL QFPDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEY VGNMTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKE DIGNYSCLVRNPVSEMESDIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDL GEAILFDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEKVAQKTMD YVCCAYNNITGRQDETHFTVIITSVGLEKLAQKGK,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term the “soluble ectodomain (ECD)” or “ectodomain” of AI216611 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of AI216611 protein):

>AI216611_PO (SEQ ID NO: 43) From 29 to 147 (SEQ ID NO: 146) LQSQGVSLYIPQATINATVKEDILLSVEYSCHGVPTIEWTYSSNWGTQKI VEWKPGTQANISQSHKDRVCTFDNGSIQLFSVGVRDSGYYVITVTERLGS SQFGTIVLHVSEILYEDLH, >AI216611_P0 (SEQ ID NO: 43) From 1 to 145 (SEQ ID NO: 298) MRPLPSGRRKTRGISLGLFALCLAAARCLQSQGVSLYIPQATINATVKED ILLSVEYSCHGVPTIEWTYSSNWGTQKIVEWKPGTQANISQSHKDRVCTF DNGSIQLFSVGVRDSGYYVITVTERLGSSQFGTIVLHVSEILYED,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term the “soluble ectodomain (ECD)” or “ectodomain” of C1ORF32 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of C1ORF32 protein):

>H19011_1_P8 (SEQ ID NO: 48) residues 21 to 186 (SEQ ID NO: 147) LQVTVPDKKKVAMLFQPTVLRCHFSTSSHQPAVVQWKFKSYCQDRMGESL GMSSTRAQSLSKRNLEWDPYLDCLDSRRTVRVVASKQGSTVTLGDFYRGR EITIVHDADLQIGKLMWGDSGLYYCIITTPDDLEGKNEGSLGLLVLGRTG LLADLLPSFAVEIMPE >H19011_1_P9 (SEQ ID NO: 50) residues 21 to 169 (SEQ ID NO: 148) LQVTVPDKKKVAMLFQPTVLRCHFSTSSHQPAVVQWKFKSYCQDRMGESL GMSSTRAQSLSKRNLEWDPYLDCLDSRRTVRVVASKQGSTVTLGDFYRGR EITIVHDADLQIGKLMWGDSGLYYCIITTPDDLEGKNEGSLGLLVLEWV, >H19011_1_P8 (SEQ ID NO: 48) residues 1 to 184 (SEQ ID NO: 299) MDRVLLRWISLFWLTAMVEGLQVTVPDKKKVAMLFQPTVLRCHFSTSSHQ PAVVQWKFKSYCQDRMGESLGMSSTRAQSLSKRNLEWDPYLDCLDSRRTV RVVASKQGSTVTLGDFYRGREITIVHDADLQIGKLMWGDSGLYYCIITTP DDLEGKNEDSVELLVLGRTGLLADLLPSFAVEIM,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term the “soluble ectodomain (ECD)” or “ectodomain” of FXYD3 refers to the polypeptide sequences below or the corresponding nucleic acid sequences (which does not comprise the signal peptide and the TM of FXYD3 protein):

>R31375_P0 (SEQ ID NO: 70); R31375_P31 (SEQ ID NO: 73) From 21 to 36 (SEQ ID NO: 149) NDLEDKNSPFYYDWHS >R31375_P14 (SEQ ID NO: 72) From 21 to 65 (SEQ ID NO: 150) NDLEDKNSPFYYGAPYIFVKRMGGQMKRTQAGTEVPSTFLLDWHS >R31375_P33 (SEQ ID NO: 74) From 21 to 25 (SEQ ID NO: 151) NDLED, >R31375_P14 (SEQ ID NO: 72) From 1 to 63 (SEQ ID NO: 297) MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYGAPYIFVKRMGGQMKR TQAGTEVPSTFLLDW,

and variants thereof possessing at least 80% sequence identity, more preferably at least 90% sequence identity therewith and even more preferably at least 95, 96, 97, 98 or 99% sequence identity therewith.

The term “immune response” refers to the action of, for example, lymphocytes, antigen presenting cells, phagocytic cells, granulocytes, and soluble macromolecules produced by the above cells or cells produced by the liver or spleen (including antibodies, cytokines, and complement) that results in selective damage to, destruction of, or elimination from the human body of invading pathogens, cells or tissues infected with pathogens, cancerous cells, or, in cases of autoimmunity or pathological inflammation, normal human cells or tissues.

A “signal, transduction pathway” refers to the biochemical relationship between a variety of signal transduction molecules that play a role in the transmission of a signal from one portion of a cell to another portion of a cell.

As used herein, the phrase “cell surface receptor” includes, for example, molecules and complexes of molecules capable of receiving a signal and the transmission of such a signal across the plasma membrane of a cell.

The term “antibody” as referred to herein includes whole polyclonal and monoclonal antibodies and any antigen binding fragment (i.e., “antigen-binding portion”) or single chains thereof. An “antibody” refers to a glycoprotein comprising at least two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds, or an antigen binding portion thereof. Each heavy chain is comprised of a heavy chain variable region (abbreviated herein as VH) and a heavy chain constant region. The heavy chain constant region is comprised of three domains, CH1, CH2 and CH3. Each light chain is comprised of a light chain variable region (abbreviated herein as VL) and a light chain constant region. The light chain constant region is comprised of one domain, CL. The VH and VL regions can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each VH and VL is composed of three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. The variable regions of the heavy and light chains contain a binding domain that interacts with an antigen. The constant regions of the antibodies may mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g., effector cells) and the first component (Clq) of the classical complement system.

The term “antigen-binding portion” of an antibody (or simply “antibody portion”), as used herein, refers to one or more fragments of an antibody that retain the ability to specifically bind to an antigen (e.g., VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3). It has been shown that the antigen-binding function of an antibody can be performed by fragments of a full-length antibody. Examples of binding fragments encompassed within the term “antigen-binding portion” of an antibody include (i) a Fab fragment, a monovalent fragment consisting of the V Light, V Heavy, Constant light (CL) and CH1 domains; (ii) a F(ab′).2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region; (iii) a Fd fragment consisting of the VH and CH1 domains; (iv) a Fv fragment consisting of the VL and VH domains of a single arm of an antibody, (v) a dAb fragment (Ward et al., (1989) Nature 341:544-546), which consists of a VH domain; and (vi) an isolated complementarity determining region (CDR). Furthermore, although the two domains of the Fv fragment, VL and VH, are coded for by separate genes, they can be joined, using recombinant methods, by a synthetic linker that enables them to be made as a single protein chain in which the VL and VH regions pair to form monovalent molecules (known as single chain Fv (scFv); see e.g., Bird et al. (1988) Science 242:423-426; and Huston et al. (1988) Proc. Natl. Acad. Sci. USA 85:5879-5883). Such single chain antibodies are also intended to be encompassed within the term “antigen-binding portion” of an antibody. These antibody fragments are obtained using conventional techniques known to those with skill in the art, and the fragments are screened for utility in the same manner as are intact antibodies.

An “isolated antibody”, as used herein, is intended to refer to an antibody that is substantially free of other antibodies having different antigenic specificities (e.g., an isolated antibody that specifically binds VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 is substantially free of antibodies that specifically bind antigens other than VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3, respectively. An isolated antibody that specifically binds VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 may, however, have cross-reactivity to other antigens, such as VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 molecules from other species, respectively. Moreover, an isolated antibody may be substantially free of other cellular material and/or chemicals.

The terms “monoclonal antibody” or “monoclonal antibody composition” as used herein refer to a preparation of antibody molecules of single molecular composition. A monoclonal antibody composition displays a single binding specificity and affinity for a particular epitope.

The term “human antibody”, as used herein, is intended to include antibodies having variable regions in which both the framework and CDR regions are derived from human germline immunoglobulin sequences. Furthermore, if the antibody contains a constant region, the constant region also is derived from human germline immunoglobulin sequences. The human antibodies of the invention may include amino acid residues not encoded by human germline immunoglobulin sequences (e.g., mutations introduced by random or site-specific mutagenesis in vitro or by somatic mutation in vivo). However, the term “human antibody”, as used herein, is not intended to include antibodies in which CDR sequences derived from the germline of another mammalian species, such as a mouse, have been grafted onto human framework sequences.

The term “human monoclonal antibody” refers to antibodies displaying a single binding specificity which have variable regions in which both the framework and CDR regions are derived from human germline immunoglobulin sequences. In one embodiment, the human monoclonal antibodies are produced by a hybridoma which includes a B cell obtained from a transgenic nonhuman animal, e.g., a transgenic mouse, having a genome comprising a human heavy chain transgene and a light chain transgene fused to an immortalized cell.

The term “recombinant human antibody”, as used herein, includes all human antibodies that are prepared, expressed, created or isolated by recombinant means, such as (a) antibodies isolated from an animal (e.g., a mouse) that is transgenic or transchromosomal for human immunoglobulin genes or a hybridoma prepared therefrom (described further below), (b) antibodies isolated from a host cell transformed to express the human antibody, e.g., from a transfectoma, (c) antibodies isolated from a recombinant, combinatorial human antibody library, and (d) antibodies prepared, expressed, created or isolated by any other means that involve splicing of human immunoglobulin gene sequences to other DNA sequences. Such recombinant human antibodies have variable regions in which the framework and CDR regions are derived from human germline immunoglobulin sequences. In certain embodiments, however, such recombinant human antibodies can be subjected to in vitro mutagenesis (or, when an animal transgenic for human Ig sequences is used, in vivo somatic mutagenesis) and thus the amino acid sequences of the VH and VL regions of the recombinant antibodies are sequences that, while derived from and related to human germline VH and VL sequences, may not naturally exist within the human antibody germline repertoire in vivo.

As used herein, “isotype” refers to the antibody class (e.g., IgM or IgG1) that is encoded by the heavy chain constant region genes.

The phrases “an antibody recognizing an antigen” and “an antibody specific for an antigen” are used interchangeably herein with the term “an antibody which binds specifically to an antigen.”

As used herein, an antibody that “specifically binds to human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 is intended to refer to an antibody that binds to human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3, respectively, preferably one with a KD of 5×10-8 M or less, more preferably 3×10-8 M or less, and even more preferably 1×10-9 M or less.

The term “K-assoc” or “Ka”, as used herein, is intended to refer to the association rate of a particular antibody-antigen interaction, whereas the term “Kdiss” or “Kd,” as used herein, is intended to refer to the dissociation rate of a particular antibody-antigen interaction. The term “KD”, as used herein, is intended to refer to the dissociation constant, which is obtained from the ratio of Kd to Ka (i.e., Kd/Ka) and is expressed as a molar concentration (M). KD values for antibodies can be determined using methods well established in the art. A preferred method for determining the KD of an antibody is by using surface Plasmon resonance, preferably using a biosensor system such as a Biacore®. system.

As used herein, the term “high affinity” for an IgG antibody refers to an antibody having a KD of 10-8 M or less, more preferably 10-9 M or less and even more preferably 10-10 M or less for a target antigen. However, “high affinity” binding can vary for other antibody isotypes. For example, “high affinity” binding for an IgM isotype refers to an antibody having a KD of 10-7 M or less, more preferably 10-8 M or less.

As used herein, the term “subject” includes any human or nonhuman animal. The term “nonhuman animal” includes all vertebrates, e.g., mammals and non-mammals, such as nonhuman primates, sheep, dogs, cats, horses, cows chickens, amphibians, reptiles, etc.

As used herein, the term “tail” refers to a peptide sequence at the end of an amino acid sequence that is unique to a splice variant according to the present invention. Therefore, a splice variant having such a tail may optionally be considered as a chimera, in that at least a first portion of the splice variant is typically highly homologous (often 100% identical) to a portion of the corresponding known protein, while at least a second portion of the variant comprises the tail.

As used herein, the term “head” refers to a peptide sequence at the beginning of an amino acid sequence that is unique to a splice variant according to the present invention. Therefore, a splice variant having such a head may optionally be considered as a chimera, in that at least a first portion of the splice variant comprises the head, while at least a second portion is typically highly homologous (often 100% identical) to a portion of the corresponding known protein.

As used herein, the term “an edge portion” refers to a connection between two portions of a splice variant according to the present invention that were not joined in the wild type or known protein. An edge may optionally arise due to a join between the above “known protein” portion of a variant and the tail, for example, and/or may occur if an internal portion of the wild type sequence is no longer present, such that two portions of the sequence are now contiguous in the splice variant that were not contiguous in the known protein. A “bridge” may optionally be an edge portion as described above, but may also include a join between a head and a “known protein” portion of a variant, or a join between a tail and a “known protein” portion of a variant, or a join between an insertion and a “known protein” portion of a variant.

In some embodiments, a bridge between a tail or a head or a unique insertion, and a “known protein” portion of a variant, comprises at least about 10 amino acids, or in some embodiments at least about 20 amino acids, or in some embodiments at least about 30 amino acids, or in some embodiments at least about 40 amino acids, in which at least one amino acid is from the tail/head/insertion and at least one amino acid is from the “known protein” portion of a variant. In some embodiments, the bridge may comprise any number of amino acids from about 10 to about 40 amino acids (for example, 10, 11, 12, 13 . . . 37, 38, 39, 40 amino acids in length, or any number in between).

It should be noted that a bridge cannot be extended beyond the length of the sequence in either direction, and it should be assumed that every bridge description is to be read in such manner that the bridge length does not extend beyond the sequence itself.

Furthermore, bridges are described with regard to a sliding window in certain contexts below. For example, certain descriptions of the bridges feature the following format: a bridge between two edges (in which a portion of the known protein is not present in the variant) may optionally be described as follows: a bridge portion of CONTIG-NAME_P1 (representing the name of the protein), comprising a polypeptide having a length “n”, wherein n is at least about 10 amino acids in length, optionally at least about 20 amino acids in length, preferably at least about 30 amino acids in length, more preferably at least about 40 amino acids in length and most preferably at least about 50 amino acids in length, wherein at least two amino acids comprise XX (2 amino acids in the center of the bridge, one from each end of the edge), having a structure as follows (numbering according to the sequence of CONTIG-NAME_P1): a sequence starting from any of amino acid numbers 49−x to 49 (for example); and ending at any of amino acid numbers 50+((n−2)−x) (for example), in which x varies from 0 to n−2. In this example, it should also be read as including bridges in which n is any number of amino acids between 10-50 amino acids in length. Furthermore, the bridge polypeptide cannot extend beyond the sequence, so it should be read such that 49−x (for example) is not less than 1, nor 50+((n−2)−x) (for example) greater than the total sequence length.

Various aspects of the invention are described in further detail in the following subsections.

Nucleic Acids

A “nucleic acid fragment” or an “oligonucleotide” or a “polynucleotide” are used herein interchangeably to refer to a polymer of nucleic acid residues. A polynucleotide sequence of the present invention refers to a single or double stranded nucleic acid sequences which is isolated and provided in the form of an RNA sequence, a complementary polynucleotide sequence (cDNA), a genomic polynucleotide sequence and/or a composite polynucleotide sequences (e.g., a combination of the above).

Thus, the present invention encompasses nucleic acid sequences described hereinabove; fragments thereof, sequences hybridizable therewith, sequences homologous thereto [e.g., at least 90%, at least 95, 96, 97, 98 or 99% or more identical to the nucleic acid sequences set forth herein], sequences encoding similar polypeptides with different codon usage, altered sequences characterized by mutations, such as deletion, insertion or substitution of one or more nucleotides, either naturally occurring or man induced, either randomly or in a targeted fashion. The present invention also encompasses homologous nucleic acid sequences (i.e., which form a part of a polynucleotide sequence of the present invention), which include sequence regions unique to the polynucleotides of the present invention.

In cases where the polynucleotide sequences of the present invention encode previously unidentified polypeptides, the present invention also encompasses novel polypeptides or portions thereof, which are encoded by the isolated polynucleotide and respective nucleic acid fragments thereof described hereinabove.

Thus, the present invention also encompasses polypeptides encoded by the polynucleotide sequences of the present invention. The present invention also encompasses homologues of these polypeptides, such homologues can be at least 90%, at least 95, 96, 97, 98 or 99% or more homologous to the amino acid sequences set forth below, as can be determined using BlastP software of the National Center of Biotechnology Information (NCBI) using default parameters. Finally, the present invention also encompasses fragments of the above described polypeptides and polypeptides having mutations, such as deletions, insertions or substitutions of one or more amino acids, either naturally occurring or man induced, either randomly or in a targeted fashion.

As mentioned hereinabove, biomolecular sequences of the present invention can be efficiently utilized as tissue or pathological markers and as putative drugs or drug targets for treating or preventing a disease.

Oligonucleotides designed for carrying out the methods of the present invention for any of the sequences provided herein (designed as described above) can be generated according to any oligonucleotide synthesis method known in the art such as enzymatic synthesis or solid phase synthesis. Equipment and reagents for executing solid-phase synthesis are commercially available from, for example, Applied Biosystems. Any other means for such synthesis may also be employed; the actual synthesis of the oligonucleotides is well within the capabilities of one skilled in the art.

Oligonucleotides used according to this aspect of the present invention are those having a length selected from a range of about 10 to about 200 bases preferably about 15 to about 150 bases, more preferably about 20 to about 100 bases, most preferably about 20 to about 50 bases.

The oligonucleotides of the present invention may comprise heterocyclic nucleosides consisting of purines and the pyrimidines bases, bonded in a 3′ to 5′ phosphodiester linkage.

Preferable oligonucleotides are those modified in either backbone, internucleoside linkages or bases, as is broadly described hereinunder. Such modifications can oftentimes facilitate oligonucleotide uptake and resistivity to intracellular conditions.

Specific examples of preferred oligonucleotides useful according to this aspect of the present invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. Oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone, as disclosed in U.S. Pat. Nos. 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466, 677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and 5,625,050.

Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkyl phosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms can also be used.

Alternatively, modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts, as disclosed in U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439.

Other oligonucleotides which can be used according to the present invention, are those modified in both sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for complementation with the appropriate polynucleotide target. An example for such an oligonucleotide mimetic, includes peptide nucleic acid (PNA). A PNA oligonucleotide refers to an oligonucleotide where the sugar-backbone is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The bases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Other backbone modifications, which can be used in the present invention are disclosed in U.S. Pat. No. 6,303,374.

Oligonucleotides of the present invention may also include base modifications or substitutions. As used herein, “unmodified” or “natural” bases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified bases include but are not limited to other synthetic and natural bases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further bases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science and Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., ed., CRC Press, 1993. Such bases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. [Sanghvi Y S et al. (1993) Antisense Research and Applications, CRC Press, Boca Raton 276-278] and are presently preferred base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.

Another modification of the oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates, which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. Such moieties include but are not limited to lipid moieties such as a cholesterol moiety, cholic acid, a thioether, e.g., hexyl-S-tritylthiol, a thiocholesterol, an aliphatic chain, e.g., dodecandiol or undecyl residues, a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, or adamantane acetic acid, a palmityl moiety, or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety, as disclosed in U.S. Pat. No. 6,303,374.

It is not necessary for all positions in a given oligonucleotide molecule to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single compound or even at a single nucleoside within an oligonucleotide.

Peptides

The terms “polypeptide,” “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an analog or mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. Polypeptides can be modified, e.g., by the addition of carbohydrate residues to form glycoproteins. The terms “polypeptide,” “peptide” and “protein” include glycoproteins, as well as non-glycoproteins.

Polypeptide products can be biochemically synthesized such as by employing standard solid phase techniques. Such methods include exclusive solid phase synthesis, partial solid phase synthesis methods, fragment condensation, classical solution synthesis. These methods are preferably used when the peptide is relatively short (i.e., 10 kDa) and/or when it cannot be produced by recombinant techniques (i.e., not encoded by a nucleic acid sequence) and therefore involves different chemistry.

Solid phase polypeptide synthesis procedures are well known in the art and further described by John Morrow Stewart and Janis Dillaha Young, Solid Phase Peptide Syntheses (2nd Ed., Pierce Chemical Company, 1984).

Synthetic polypeptides can be purified by preparative high performance liquid chromatography [Creighton T. (1983) Proteins, structures and molecular principles. WH Freeman and Co. N.Y.] and the composition of which can be confirmed via amino acid sequencing.

In cases where large amounts of a polypeptide are desired, it can be generated using recombinant techniques such as described by Bitter et al., (1987) Methods in Enzymol. 153:516-544, Studier et al. (1990) Methods in Enzymol. 185:60-89, Brisson et al. (1984) Nature 310:511-514, Takamatsu et al. (1987) EMBO J. 6:307-311, Coruzzi et al. (1984) EMBO J. 3:1671-1680 and Brogli et al., (1984) Science 224:838-843, Gurley et al. (1986) Mol. Cell. Biol. 6:559-565 and Weissbach & Weissbach, 1988, Methods for Plant Molecular Biology, Academic Press, NY, Section VIII, pp 421-463.

It will be appreciated that peptides identified according to the teachings of the present invention may be degradation products, synthetic peptides or recombinant peptides as well as peptidomimetics, typically, synthetic peptides and peptoids and semipeptoids which are peptide analogs, which may have, for example, modifications rendering the peptides more stable while in a body or more capable of penetrating into cells. Such modifications include, but are not limited to N terminus modification, C terminus modification, peptide bond modification, including, but not limited to, CH2-NH, CH2-S, CH2-S═O, O═C—NH, CH2-O, CH2-CH2, S═C—NH, CH═CH or CF═CH, backbone modifications, and residue modification. Methods for preparing peptidomimetic compounds are well known in the art and are specified, for example, in Quantitative Drug Design, C. A. Ramsden Gd., Chapter 17.2, F. Choplin Pergamon Press (1992), which is incorporated by reference as if fully set forth herein. Further details in this respect are provided hereinunder.

Peptide bonds (—CO—NH—) within the peptide may be substituted, for example, by N-methylated bonds (—N(CH3)-CO—), ester bonds (—C(R)H—C—O—O—C(R)—N—), ketomethylen bonds (—CO—CH2-), α-aza bonds (—NH—N(R)—CO—), wherein R is any alkyl, e.g., methyl, carba bonds (—CH2-NH—), hydroxyethylene bonds (—CH(OH)—CH2-), thioamide bonds (—CS—NH—), olefinic double bonds (—CH═CH—), retro amide bonds (—NH—CO—), peptide derivatives (—N(R)—CH2-CO—), wherein R is the “normal” side chain, naturally presented on the carbon atom.

These modifications can occur at any of the bonds along the peptide chain and even at several (2-3) at the same time.

Natural aromatic amino acids, Trp, Tyr and Phe, may be substituted by synthetic non-natural acid such as Phenylglycine, TIC, naphthylelanine (Nol), ring-methylated derivatives of Phe, halogenated derivatives of Phe or o-methyl-Tyr.

In addition to the above, the peptides of the present invention may also include one or more modified amino acids or one or more non-amino acid monomers (e.g. fatty acids, complex carbohydrates etc).

As used herein in the specification and in the claims section below the term “amino acid” or “amino acids” is understood to include the 20 naturally occurring amino acids; those amino acids often modified post-translationally in vivo, including, for example, hydroxyproline, phosphoserine and phosphothreonine; and other unusual amino acids including, but not limited to, 2-aminoadipic acid, hydroxylysine, isodesmosine, nor-valine, nor-leucine and ornithine. Furthermore, the term “amino acid” includes both D- and L-amino acids.

Since the peptides of the present invention are preferably utilized in therapeutics which require the peptides to be in soluble form, the peptides of the present invention preferably include one or more non-natural or natural polar amino acids, including but not limited to serine and threonine which are capable of increasing peptide solubility due to their hydroxyl-containing side chain.

The peptides of the present invention are preferably utilized in a linear form, although it will be appreciated that in cases where cyclization does not severely interfere with peptide characteristics, cyclic forms of the peptide can also be utilized.

The peptides of the present invention can be biochemically synthesized such as by using standard solid phase techniques. These methods include exclusive solid phase synthesis, partial solid phase synthesis methods, fragment condensation, classical solution synthesis. These methods are preferably used when the peptide is relatively short (i.e., 10 kDa) and/or when it cannot be produced by recombinant techniques (i.e., not encoded by a nucleic acid sequence) and therefore involves different chemistry.

Solid phase peptide synthesis procedures are well known in the art and further described by John Morrow Stewart and Janis Dillaha Young, Solid Phase Peptide Syntheses (2nd Ed., Pierce Chemical Company, 1984).

Synthetic peptides can be purified by preparative high performance liquid chromatography [Creighton T. (1983) Proteins, structures and molecular principles. WH Freeman and Co. N.Y.] and the composition of which can be confirmed via amino acid sequencing.

In cases where large amounts of the peptides of the present invention are desired, the peptides of the present invention can be generated using recombinant techniques such as described by Bitter et al., (1987) Methods in Enzymol. 153:516-544, Studier et al. (1990) Methods in Enzymol. 185:60-89, Brisson et al. (1984) Nature 310:511-514, Takamatsu et al. (1987) EMBO J. 6:307-311, Coruzzi et al. (1984) EMBO J. 3:1671-1680 and Brogli et al., (1984) Science 224:838-843, Gurley et al. (1986) Mol. Cell. Biol. 6:559-565 and Weissbach & Weissbach, 1988, Methods for Plant Molecular Biology, Academic Press, NY, Section VIII, pp 421-463.

Expression Systems

To enable cellular expression of the polynucleotides of the present invention, a nucleic acid construct according to the present invention may be used, which includes at least a coding region of one of the above nucleic acid sequences, and further includes at least one cis acting regulatory element. As used herein, the phrase “cis acting regulatory element” refers to a polynucleotide sequence, preferably a promoter, which binds a trans acting regulator and regulates the transcription of a coding sequence located downstream thereto.

Any suitable promoter sequence can be used by the nucleic acid construct of the present invention.

Preferably, the promoter utilized by the nucleic acid construct of the present invention is active in the specific cell population transformed. Examples of cell type-specific and/or tissue-specific promoters include promoters such as albumin that is liver specific [Pinkert et al., (1987) Genes Dev. 1:268-277], lymphoid specific promoters [Calame et al., (1988) Adv. Immunol. 43:235-275]; in particular promoters of T-cell receptors [Winoto et al., (1989) EMBO J. 8:729-733] and immunoglobulins; [Banerji et al. (1983) Cell 33729-740], neuron-specific promoters such as the neurofilament promoter [Byrne et al. (1989) Proc. Natl. Acad. Sci. USA 86:5473-5477], pancreas-specific promoters [Edlunch et al. (1985) Science 230:912-916] or mammary gland-specific promoters such as the milk whey promoter (U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). The nucleic acid construct of the present invention can further include an enhancer, which can be adjacent or distant to the promoter sequence and can function in up regulating the transcription therefrom.

The nucleic acid construct of the present invention preferably further includes an appropriate selectable marker and/or an origin of replication. Preferably, the nucleic acid construct utilized is a shuttle vector, which can propagate both in E. coli (wherein the construct comprises an appropriate selectable marker and origin of replication) and be compatible for propagation in cells, or integration in a gene and a tissue of choice. The construct according to the present invention can be, for example, a plasmid, a bacmid, a phagemid, a cosmid, a phage, a virus or an artificial chromosome.

Examples of suitable constructs include, but are not limited to, pcDNA3, pcDNA3.1 (+/−), pGL3, PzeoSV2 (+/−), pDisplay, pEF/myc/cyto, pCMV/myc/cyto each of which is commercially available from Invitrogen Co. (www.invitrogen.com). Examples of retroviral vector and packaging systems are those sold by Clontech, San Diego, Calif., including Retro-X vectors pLNCX and pLXSN, which permit cloning into multiple cloning sites and the transgene is transcribed from CMV promoter. Vectors derived from Mo-MuLV are also included such as pBabe, where the transgene will be transcribed from the 5′LTR promoter.

Currently preferred in vivo nucleic acid transfer techniques include transfection with viral or non-viral constructs, such as adenovirus, lentivirus, Herpes simplex I virus, or adeno-associated virus (AAV) and lipid-based systems. Useful lipids for lipid-mediated transfer of the gene are, for example, DOTMA, DOPE, and DC-Chol [Tonkinson et al., Cancer Investigation, 14(1): 54-65 (1996)]. The most preferred constructs for use in gene therapy are viruses, most preferably adenoviruses, AAV, lentiviruses, or retroviruses. A viral construct such as a retroviral construct includes at least one transcriptional promoter/enhancer or locus-defining elements, or other elements that control gene expression by other means such as alternate splicing, nuclear RNA export, or post-translational modification of messenger. Such vector constructs also include a packaging signal, long terminal repeats (LTRs) or portions thereof, and positive and negative strand primer binding sites appropriate to the virus used, unless it is already present in the viral construct. In addition, such a construct typically includes a signal sequence for secretion of the peptide from a host cell in which it is placed. Preferably the signal sequence for this purpose is a mammalian signal sequence or the signal sequence of the polypeptides of the present invention. Optionally, the construct may also include a signal that directs polyadenylation, as well as one or more restriction sites and a translation termination sequence. By way of example, such constructs will typically include a 5′ LTR, a tRNA binding site, a packaging signal, an origin of second-strand DNA synthesis, and a 3′ LTR or a portion thereof. Other vectors can be used that are non-viral, such as cationic lipids, polylysine, and dendrimers.

Recombinant Expression Vectors and Host Cells

Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid encoding a protein of the invention, or derivatives, fragments, analogs or homologs thereof. As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a “plasmid”, which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors”. In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, “plasmid” and “vector” can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions.

The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably-linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequences in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).

The term “regulatory sequence” is intended to includes promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein.

The recombinant expression vectors of the invention can be designed for production of variant proteins in prokaryotic or eukaryotic cells. For example, proteins of the invention can be expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, to the amino or C terminus of the recombinant protein. Such fusion vectors typically serve three purposes: (i) to increase expression of recombinant protein; (ii) to increase the solubility of the recombinant protein; and (iii) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin, PreScission, TEV and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein.

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann et al., (1988) Gene 69:301-315) and pET 11d (Studier et al., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 60-89)—not accurate, pET11a-d have N terminal T7 tag.

One strategy to maximize recombinant protein expression in E. coli is to express the protein in a host bacterium with an impaired capacity to proteolytically cleave the recombinant protein. See, e.g., Gottesman, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in E. coli (see, e.g., Wada, et al., 1992. Nucl. Acids Res. 20: 2111-2118). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques. Another strategy to solve codon bias is by using BL21-codon plus bacterial strains (Invitrogen) or Rosetta bacterial strain (Novagen), these strains contain extra copies of rare E. coli tRNA genes.

In another embodiment, the expression vector encoding for the protein of the invention is a yeast expression vector. Examples of vectors for expression in yeast Saccharomyces cerevisiae include pYepSec1 (Baldari, et al., 1987. EMBO J. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 933-943), pJRY88 (Schultz et al., 1987. Gene 54: 113-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.).

Alternatively, polypeptides of the present invention can be produced in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 cells) include the pAc series (Smith, et al., 1983. Mol. Cell. Biol. 3: 2156-2165) and the pVL series (Lucklow and Summers, 1989. Virology 170: 31-39).

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al., 1987. EMBO J. 6: 187-195), pIRESpuro (Clontech), pUB6 (Invitrogen), pCEP4 (Invitrogen) pREP4 (Invitrogen), pcDNA3 (Invitrogen). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus, Rous Sarcoma Virus, and simian virus 40. For other suitable expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al., Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.

In another embodiment, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et al., 1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 8: 729-733) and immunoglobulins (Banerji, et al., 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), pancreas-specific promoters (Edlund, et al., 1985. Science 230: 912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) and the alpha-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546).

The invention further provides a recombinant expression vector comprising a DNA molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to mRNA encoding for protein of the invention. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense orientation can be chosen that direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a high efficiency regulatory region, the activity of which can be determined by the cell type into which the vector is introduced. For a discussion of the regulation of gene expression using antisense genes see, e.g., Weintraub, et al., “Antisense RNA as a molecular tool for genetic analysis,” Reviews-Trends in Genetics, Vol. 1(1) 1986.

Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms “host cell” and “recombinant host cell” are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.

A host cell can be any prokaryotic or eukaryotic cell. For example, protein of the invention can be produced in bacterial cells such as E. coli, insect cells, yeast, plant or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS or 293 cells). Other suitable host cells are known to those skilled in the art.

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms “transformation” and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals.

For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Various selectable markers include those that confer resistance to drugs, such as G418, hygromycin, puromycin, blasticidin and methotrexate. Nucleic acids encoding a selectable marker can be introduced into a host cell on the same vector as that encoding protein of the invention or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) protein of the invention. Accordingly, the invention further provides methods for producing proteins of the invention using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of the present invention (into which a recombinant expression vector encoding protein of the invention has been introduced) in a suitable medium such that the protein of the invention is produced. In another embodiment, the method further comprises isolating protein of the invention from the medium or the host cell.

For efficient production of the protein, it is preferable to place the nucleotide sequences encoding the protein of the invention under the control of expression control sequences optimized for expression in a desired host. For example, the sequences may include optimized transcriptional and/or translational regulatory sequences (such as altered Kozak sequences).

Protein Modifications

Fusion Proteins

According to the present invention, a fusion protein may be prepared from a protein of the invention by fusion with a portion of an immunoglobulin comprising a constant region of an immunoglobulin. More preferably, the portion of the immunoglobulin comprises a heavy chain constant region which is optionally and more preferably a human heavy chain constant region. The heavy chain constant region is most preferably an IgG heavy chain constant region, and optionally and most preferably is an Fc chain, most preferably an IgG Fc fragment that comprises CH2 and CH3 domains. Although any IgG subtype may optionally be used, the IgG1 subtype is preferred. The Fc chain may optionally be a known or “wild type” Fc chain, or alternatively may be mutated. Non-limiting, illustrative, exemplary types of mutations are described in US Patent Application No. 20060034852, published on Feb. 16, 2006, hereby incorporated by reference as if fully set forth herein. The term “Fc chain” also optionally comprises any type of Fc fragment.

Several of the specific amino acid residues that are important for antibody constant region-mediated activity in the IgG subclass have been identified. Inclusion, substitution or exclusion of these specific amino acids therefore allows for inclusion or exclusion of specific immunoglobulin constant region-mediated activity. Furthermore, specific changes may result in aglycosylation for example and/or other desired changes to the Fc chain. At least some changes may optionally be made to block a function of Fc which is considered to be undesirable, such as an undesirable immune system effect, as described in greater detail below.

Non-limiting, illustrative examples of mutations to Fc which may be made to modulate the activity of the fusion protein include the following changes (given with regard to the Fc sequence nomenclature as given by Kabat, from Kabat E A et al: Sequences of Proteins of Immunological Interest. US Department of Health and Human Services, NIH, 1991): 220C→S; 233-238 ELLGGP→EAEGAP; 265D→A, preferably in combination with 434N→A; 297N→A (for example to block N-glycosylation); 318-322 EYKCK→AYACA; 330-331AP→SS; or a combination thereof (see for example M. Clark, “Chemical Immunol and Antibody Engineering”, pp 1-31 for a description of these mutations and their effect). The construct for the Fc chain which features the above changes optionally and preferably comprises a combination of the hinge region with the CH2 and CH3 domains.

The above mutations may optionally be implemented to enhance desired properties or alternatively to block non-desired properties. For example, aglycosylation of antibodies was shown to maintain the desired binding functionality while blocking depletion of T-cells or triggering cytokine release, which may optionally be undesired functions (see M. Clark, “Chemical Immunol and Antibody Engineering”, pp 1-31). Substitution of 331proline for serine may block the ability to activate complement, which may optionally be considered an undesired function (see M. Clark, “Chemical Immunol and Antibody Engineering”, pp 1-31). Changing 330alanine to serine in combination with this change may also enhance the desired effect of blocking the ability to activate complement.

Residues 235 and 237 were shown to be involved in antibody-dependent cell-mediated cytotoxicity (ADCC), such that changing the block of residues from 233-238 as described may also block such activity if ADCC is considered to be an undesirable function.

Residue 220 is normally a cysteine for Fc from IgG1, which is the site at which the heavy chain forms a covalent linkage with the light chain. Optionally, this residue may be changed to a serine, to avoid any type of covalent linkage (see M. Clark, “Chemical Immunol and Antibody Engineering”, pp 1-31).

The above changes to residues 265 and 434 may optionally be implemented to reduce or block binding to the Fc receptor, which may optionally block undesired functionality of Fc related to its immune system functions (see “Binding site on Human IgG1 for Fc Receptors”, Shields et al, Vol 276, pp 6591-6604, 2001).

The above changes are intended as illustrations only of optional changes and are not meant to be limiting in any way. Furthermore, the above explanation is provided for descriptive purposes only, without wishing to be bound by a single hypothesis.

Addition of Groups

If a protein according to the present invention is a linear molecule, it is possible to place various functional groups at various points on the linear molecule which are susceptible to or suitable for chemical modification. Functional groups can be added to the termini of linear forms of the protein of the invention. In some embodiments, the functional groups improve the activity of the protein with regard to one or more characteristics, including but not limited to, improvement in stability, penetration (through cellular membranes and/or tissue barriers), tissue localization, efficacy, decreased clearance, decreased toxicity, improved selectivity, improved resistance to expulsion by cellular pumps, and the like. For convenience sake and without wishing to be limiting, the free N-terminus of one of the sequences contained in the compositions of the invention will be termed as the N-terminus of the composition, and the free C-terminal of the sequence will be considered as the C-terminus of the composition. Either the C-terminus or the N-terminus of the sequences, or both, can be linked to a carboxylic acid functional groups or an amine functional group, respectively.

Non-limiting examples of suitable functional groups are described in Green and Wuts, “Protecting Groups in Organic Synthesis”, John Wiley and Sons, Chapters 5 and 7, 1991, the teachings of which are incorporated herein by reference. Preferred protecting groups are those that facilitate transport of the active ingredient attached thereto into a cell, for example, by reducing the hydrophilicity and increasing the lip ophilicity of the active ingredient, these being an example for “a moiety for transport across cellular membranes”.

These moieties can optionally and preferably be cleaved in vivo, either by hydrolysis or enzymatically, inside the cell. (Ditter et al., J. Pharm. Sci. 57:783 (1968); Ditter et al., J. Pharm. Sci. 57:828 (1968); Ditter et al., J. Pharm. Sci. 58:557 (1969); King et al., Biochemistry 26:2294 (1987); Lindberg et al., Drug Metabolism and Disposition 17:311 (1989); and Tunek et al., Biochem. Pharm. 37:3867 (1988), Anderson et al., Arch. Biochem. Biophys. 239:538 (1985) and Singhal et al., FASEB J. 1:220 (1987)). Hydroxyl protecting groups include esters, carbonates and carbamate protecting groups. Amine protecting groups include alkoxy and aryloxy carbonyl groups, as described above for N-terminal protecting groups. Carboxylic acid protecting groups include aliphatic, benzylic and aryl esters, as described above for C-terminal protecting groups. In one embodiment, the carboxylic acid group in the side chain of one or more glutamic acid or aspartic acid residue in a composition of the present invention is protected, preferably with a methyl, ethyl, benzyl or substituted benzyl ester, more preferably as a benzyl ester.

Non-limiting, illustrative examples of N-terminal protecting groups include acyl groups (—CO—R1) and alkoxy carbonyl or aryloxy carbonyl groups (—CO—O—R1), wherein R1 is an aliphatic, substituted aliphatic, benzyl, substituted benzyl, aromatic or a substituted aromatic group. Specific examples of acyl groups include but are not limited to acetyl, (ethyl)-CO—, n-propyl-CO—, iso-propyl-CO—, n-butyl-CO—, sec-butyl-CO—, t-butyl-CO—, hexyl, lauroyl, palmitoyl, myristoyl, stearyl, oleoyl phenyl-CO—, substituted phenyl-CO—, benzyl-CO— and (substituted benzyl)-CO—. Examples of alkoxy carbonyl and aryloxy carbonyl groups include CH3-O—CO—, (ethyl)-O—CO—, n-propyl-O—CO—, iso-propyl-O—CO—, n-butyl-O—CO—, sec-butyl-O—CO—, t-butyl-O—CO—, phenyl-O—CO—, substituted phenyl-O—CO— and benzyl-O—CO—, (substituted benzyl)-O—CO—, Adamantan, naphtalen, myristoleyl, toluen, biphenyl, cinnamoyl, nitrobenzoy, toluoyl, furoyl, benzoyl, cyclohexane, norbornane, or Z-caproic. In order to facilitate the N-acylation, one to four glycine residues can be present in the N-terminus of the molecule.

The carboxyl group at the C-terminus of the compound can be protected, for example, by a group including but not limited to an amide (i.e., the hydroxyl group at the C-terminus is replaced with —NH₂, —NHR₂ and —NR₂R₃) or ester (i.e. the hydroxyl group at the C-terminus is replaced with —OR₂). R₂ and R₃ are optionally independently an aliphatic, substituted aliphatic, benzyl, substituted benzyl, aryl or a substituted aryl group. In addition, taken together with the nitrogen atom, R₂ and R₃ can optionally form a C4 to C8 heterocyclic ring with from about 0-2 additional heteroatoms such as nitrogen, oxygen or sulfur. Non-limiting suitable examples of suitable heterocyclic rings include piperidinyl, pyrrolidinyl, morpholino, thiomorpholino or piperazinyl. Examples of C-terminal protecting groups include but are not limited to —NH₂, —NHCH₃, —N(CH₃)₂, —NH (ethyl), —N(ethyl)₂, —N(methyl) (ethyl), —NH (benzyl), —N(C1-C4 alkyl)(benzyl), —NH(phenyl), —N(C1-C4 alkyl) (phenyl), —OCH₃, —O-(ethyl), —O-(n-propyl), —O-(n-butyl), —O-(iso-propyl), —O-(sec-butyl), —O-(t-butyl), —O-benzyl and —O-phenyl.

Substitution by Peptidomimetic Moieties

A “peptidomimetic organic moiety” can optionally be substituted for amino acid residues in the composition of this invention both as conservative and as non-conservative substitutions. These moieties are also termed “non-natural amino acids” and may optionally replace amino acid residues, amino acids or act as spacer groups within the peptides in lieu of deleted amino acids. The peptidomimetic organic moieties optionally and preferably have steric, electronic or configurational properties similar to the replaced amino acid and such peptidomimetics are used to replace amino acids in the essential positions, and are considered conservative substitutions. However such similarities are not necessarily required. According to preferred embodiments of the present invention, one or more peptidomimetics are selected such that the composition at least substantially retains its physiological activity as compared to the native protein according to the present invention.

Peptidomimetics may optionally be used to inhibit degradation of the peptides by enzymatic or other degradative processes. The peptidomimetics can optionally and preferably be produced by organic synthetic techniques. Non-limiting examples of suitable peptidomimetics include D amino acids of the corresponding L amino acids, tetrazol (Zabrocki et al., J. Am. Chem. Soc. 110:5875-5880 (1988)); isosteres of amide bonds (Jones et al., Tetrahedron Lett. 29: 3853-3856 (1988)); LL-3-amino-2-propenidone-6-carboxylic acid (LL-Acp) (Kemp et al., J. Org. Chem. 50:5834-5838 (1985)). Similar analogs are shown in Kemp et al., Tetrahedron Lett. 29:5081-5082 (1988) as well as Kemp et al., Tetrahedron Lett. 29:5057-5060 (1988), Kemp et al., Tetrahedron Lett. 29:4935-4938 (1988) and Kemp et al., J. Org. Chem. 54:109-115 (1987). Other suitable but exemplary peptidomimetics are shown in Nagai and Sato, Tetrahedron Lett. 26:647-650 (1985); Di Maio et al., J. Chem. Soc. Perkin Trans., 1687 (1985); Kahn et al., Tetrahedron Lett. 30:2317 (1989); Olson et al., J. Am. Chem. Soc. 112:323-333 (1990); Garvey et al., J. Org. Chem. 56:436 (1990). Further suitable exemplary peptidomimetics include hydroxy-1,2,3,4-tetrahydroisoquinoline-3-carboxylate (Miyake et al., J. Takeda Res. Labs 43:53-76 (1989)); 1,2,3,4-tetrahydro-isoquinoline-3-carboxylate (Kazmierski et al., J. Am. Chem. Soc. 133:2275-2283 (1991)); histidine isoquinolone carboxylic acid (HIC) (Zechel et al., Int. J. Pep. Protein Res. 43 (1991)); (2S, 3S)-methyl-phenylalanine, (2S, 3R)-methyl-phenylalanine, (2R, 3S)-methyl-phenylalanine and (2R, 3R)-methyl-phenylalanine (Kazmierski and Hruby, Tetrahedron Lett. (1991)).

Exemplary, illustrative but non-limiting non-natural amino acids include beta-amino acids (beta3 and beta2), homo-amino acids, cyclic amino acids, aromatic amino acids, Pro and Pyr derivatives, 3-substituted Alanine derivatives, Glycine derivatives, ring-substituted Phe and Tyr Derivatives, linear core amino acids or diamino acids. They are available from a variety of suppliers, such as Sigma-Aldrich (USA) for example.

Chemical Modifications

In the present invention any part of a protein of the invention may optionally be chemically modified, i.e. changed by addition of functional groups. For example the side amino acid residues appearing in the native sequence may optionally be modified, although as described below alternatively other parts of the protein may optionally be modified, in addition to or in place of the side amino acid residues. The modification may optionally be performed during synthesis of the molecule if a chemical synthetic process is followed, for example by adding a chemically modified amino acid. However, chemical modification of an amino acid when it is already present in the molecule (“in situ” modification) is also possible.

The amino acid of any of the sequence regions of the molecule can optionally be modified according to any one of the following exemplary types of modification (in the peptide conceptually viewed as “chemically modified”). Non-limiting exemplary types of modification include carboxymethylation, acylation, phosphorylation, glycosylation or fatty acylation. Ether bonds can optionally be used to join the serine or threonine hydroxyl to the hydroxyl of a sugar. Amide bonds can optionally be used to join the glutamate or aspartate carboxyl groups to an amino group on a sugar (Garg and Jeanloz, Advances in Carbohydrate Chemistry and Biochemistry, Vol. 43, Academic Press (1985); Kunz, Ang. Chem. Int. Ed. English 26:294-308 (1987)). Acetal and ketal bonds can also optionally be formed between amino acids and carbohydrates. Fatty acid acyl derivatives can optionally be made, for example, by acylation of a free amino group (e.g., lysine) (Toth et al., Peptides: Chemistry, Structure and Biology, Rivier and Marshal, eds., ESCOM Publ., Leiden, 1078-1079 (1990)).

As used herein the term “chemical modification”, when referring to a protein or peptide according to the present invention, refers to a protein or peptide where at least one of its amino acid residues is modified either by natural processes, such as processing or other post-translational modifications, or by chemical modification techniques which are well known in the art. Examples of the numerous known modifications typically include, but are not limited to: acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, pegylation, prenylation, phosphorylation, ubiquitination, or any similar process.

Other types of modifications optionally include the addition of a cycloalkane moiety to a biological molecule, such as a protein, as described in PCT Application No. WO 2006/050262, hereby incorporated by reference as if fully set forth herein. These moieties are designed for use with biomolecules and may optionally be used to impart various properties to proteins.

Furthermore, optionally any point on a protein may be modified. For example, pegylation of a glycosylation moiety on a protein may optionally be performed, as described in PCT Application No. WO 2006/050247, hereby incorporated by reference as if fully set forth herein. One or more polyethylene glycol (PEG) groups may optionally be added to O-linked and/or N-linked glycosylation. The PEG group may optionally be branched or linear. Optionally any type of water-soluble polymer may be attached to a glycosylation site on a protein through a glycosyl linker.

Altered Glycosylation

Proteins of the invention may be modified to have an altered glycosylation pattern (i.e., altered from the original or native glycosylation pattern). As used herein, “altered” means having one or more carbohydrate moieties deleted, and/or having at least one glycosylation site added to the original protein.

Glycosylation of proteins is typically either N-linked or O-linked. N-linked refers to the attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tripeptide sequences, asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, are the recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. Thus, the presence of either of these tripeptide sequences in a polypeptide creates a potential glycosylation site. O-linked glycosylation refers to the attachment of one of the sugars N-acetylgalactosamine, galactose, or xylose to a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5-hydroxylysine may also be used.

Addition of glycosylation sites to proteins of the invention is conveniently accomplished by altering the amino acid sequence of the protein such that it contains one or more of the above-described tripeptide sequences (for N-linked glycosylation sites). The alteration may also be made by the addition of, or substitution by, one or more serine or threonine residues in the sequence of the original protein (for O-linked glycosylation sites). The protein's amino acid sequence may also be altered by introducing changes at the DNA level.

Another means of increasing the number of carbohydrate moieties on proteins is by chemical or enzymatic coupling of glycosides to the amino acid residues of the protein. Depending on the coupling mode used, the sugars may be attached to (a) arginine and histidine, (b) free carboxyl groups, (c) free sulfhydryl groups such as those of cysteine, (d) free hydroxyl groups such as those of serine, threonine, or hydroxyproline, (e) aromatic residues such as those of phenylalanine, tyrosine, or tryptophan, or (f) the amide group of glutamine. These methods are described in WO 87/05330, and in Aplin and Wriston, CRC Crit. Rev. Biochem., 22: 259-306 (1981).

Removal of any carbohydrate moieties present on proteins of the invention may be accomplished chemically or enzymatically. Chemical deglycosylation requires exposure of the protein to trifluoromethanesulfonic acid, or an equivalent compound. This treatment results in the cleavage of most or all sugars except the linking sugar (N-acetylglucosamine or N-acetylgalactosamine), leaving the amino acid sequence intact.

Chemical deglycosylation is described by Hakimuddin et al., Arch. Biochem. Biophys., 259: 52 (1987); and Edge et al., Anal. Biochem., 118: 131 (1981). Enzymatic cleavage of carbohydrate moieties on proteins can be achieved by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzymol., 138: 350 (1987).

Methods of Treatment

As mentioned hereinabove the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and polypeptides of the present invention or nucleic acid sequence or fragments thereof especially the ectodomain or secreted forms of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins, as well as drugs which specifically bind to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants, and/or drugs which agonize or antagonize the binding of other moieties to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants, and/or drugs which modulate (agonize or antagonize) at least one VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 related biological activity (such drugs include by way of example antibodies, small molecules, peptides, ribozymes, antisense molecules, siRNA's and the like), can be used to treat cancer, including but not limited to non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic.

The VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and polypeptides of the present invention or nucleic acid sequence or fragments thereof especially the ectodomain or secreted forms of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins, as well as drugs which specifically bind to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants, and/or drugs which agonize or antagonize the binding of other moieties to the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 proteins and/or splice variants, and/or drugs which modulate (agonize or antagonize) at least one VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 related biological activity (such drugs include by way of example antibodies, small molecules, peptides, ribozymes, antisense molecules, siRNA's and the like), can be further used to treat non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease, and/or for blocking or promoting immune costimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 polypeptide.

Thus, according to an additional aspect of the present invention there is provided a method of treating cancer, including but not limited to non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease, and/or for blocking or promoting immune costimulation mediated by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 polypeptide in a subject.

The subject according to the present invention is a mammal, preferably a human which is diagnosed with one of the disease, disorder or conditions described hereinabove, or alternatively is predisposed to at least one type of cancer, including but not limited to non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain, as well as non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease.

As used herein the term “treating” refers to preventing, curing, reversing, attenuating, alleviating, minimizing, suppressing or halting the deleterious effects of the above-described diseases, disorders or conditions.

Treating, according to the present invention, can be effected by specifically upregulating the expression of at least one of the polypeptides of the present invention in the subject.

Optionally, upregulation may be effected by administering to the subject at least one of the polypeptides of the present invention (e.g., recombinant or synthetic) or an active portion thereof, as described herein. However, since the bioavailability of large polypeptides may potentially be relatively small due to high degradation rate and low penetration rate, administration of polypeptides is preferably confined to small peptide fragments (e.g., about 100 amino acids). The polypeptide or peptide may optionally be administered in as part of a pharmaceutical composition, described in more detail below.

It will be appreciated that treatment of the above-described diseases according to the present invention may be combined with other treatment methods known in the art (i.e., combination therapy). Thus, treatment of malignancies using the agents of the present invention may be combined with, for example, radiation therapy, antibody therapy and/or chemotherapy.

Alternatively or additionally, an upregulating method may optionally be effected by specifically upregulating the amount (optionally expression) in the subject of at least one of the polypeptides of the present invention or active portions thereof.

As is mentioned hereinabove and in the Examples section which follows, the biomolecular sequences of this aspect of the present invention may be used as valuable therapeutic tools in the treatment of diseases, disorders or conditions in which altered activity or expression of the wild-type gene product (known protein) is known to contribute to disease, disorder or condition onset or progression. For example, in case a disease is caused by overexpression of a membrane bound-receptor, a soluble variant thereof may be used as an antagonist which competes with the receptor for binding the ligand, to thereby terminate signaling from the receptor.

Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32, Anti-FXYD3 Antibodies

The antibodies of the invention including those having the particular germline sequences, homologous antibodies, antibodies with conservative modifications, engineered and modified antibodies are characterized by particular functional features or properties of the antibodies. For example, the antibodies bind specifically to human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3. Preferably, an antibody of the invention binds to corresponding VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 with high affinity, for example with a KD of 10-8 M or less or 10-9 M or less or even 10-10 M or less. The anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies of the invention preferably exhibit one or more of the following characteristics:

(i) binds to corresponding human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 with a KD of 5×10-8 M or less;

(ii) modulates (enhances or inhibits) B7 immune costimulation and related activities and functions such a T cell responses involved in antitumor immunity and autoimmunity. and/or

(iii) binds to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen expressed by cancer cells including for example lung cancer, ovarian cancer, colon cancer, but does not substantially bind to normal cells In addition, preferably these antibodies and conjugates thereof will be effective in eliciting selective killing of such cancer cells and for modulating immune responses involved in autoimmunity and cancer.

More preferably, the antibody binds to corresponding human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen with a KD of 3×10-8 M or less, or with a KD of 1×10-9 M or less, or with a KD of 0.1×10-9 M or less, or with a KD Of 0.05×10-9 M or less or with a KD of between 1×10-9 and 1×10-11 M.

Standard assays to evaluate the binding ability of the antibodies toward VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 are known in the art, including for example, ELISAs, Western blots and RIAs. Suitable assays are described in detail in the Examples. The binding kinetics (e.g., binding affinity) of the antibodies also can be assessed by standard assays known in the art, such as by Biacore analysis.

Upon production of anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody sequences from antibodies can bind to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 the VH and VL sequences can be “mixed and matched” to create other anti-VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 binding molecules of the invention. VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 binding of such “mixed and matched” antibodies can be tested using the binding assays described above. e.g., ELISAs). Preferably, when VH and VL chains are mixed and matched, a VH sequence from a particular VH/VL pairing is replaced with a structurally similar VH sequence. Likewise, preferably a VL sequence from a particular VH/VL pairing is replaced with a structurally similar VL sequence. For example, the VH and VL sequences of homologous antibodies are particularly amenable for mixing and matching.

Antibodies Having Particular Germline Sequences

In certain embodiments, an antibody of the invention comprises a heavy chain variable region from a particular germline heavy chain immunoglobulin gene and/or a light chain variable region from a particular germline light chain immunoglobulin gene.

As used herein, a human antibody comprises heavy or light chain variable regions that is “the product of” or “derived from” a particular germline sequence if the variable regions of the antibody are obtained from a system that uses human germline immunoglobulin genes. Such systems include immunizing a transgenic mouse carrying human immunoglobulin genes with the antigen of interest or screening a human immunoglobulin gene library displayed on phage with the antigen of interest. A human antibody that is “the product of” or “derived from” a human germline immunoglobulin sequence can be identified as such by comparing the amino acid sequence of the human antibody to the amino acid sequences of human germline immunoglobulins and selecting the human germline immunoglobulin sequence that is closest in sequence (i.e., greatest % identity) to the sequence of the human antibody.

A human antibody that is “the product of” or “derived from” a particular human germline immunoglobulin sequence may contain amino acid differences as compared to the germline sequence, due to, for example, naturally-occurring somatic mutations or intentional introduction of site-directed mutation. However, a selected human antibody typically is at least 90% identical in amino acids sequence to an amino acid sequence encoded by a human germline immunoglobulin gene and contains amino acid residues that identify the human antibody as being human when compared to the germline immunoglobulin amino acid sequences of other species (e.g., murine germline sequences). In certain cases, a human antibody may be at least 95, 96, 97, 98 or 99%, or even at least 96%, 97%, 98%, or 99% identical in amino acid sequence to the amino acid sequence encoded by the germline immunoglobulin gene. Typically, a human antibody derived from a particular human germline sequence will display no more than 10 amino acid differences from the amino acid sequence encoded by the human germline immunoglobulin gene. In certain cases, the human antibody may display no more than 5, or even no more than 4, 3, 2, or 1 amino acid difference from the amino acid sequence encoded by the germline immunoglobulin gene.

Homologous Antibodies

In yet another embodiment, an antibody of the invention comprises heavy and light chain variable regions comprising amino acid sequences that are homologous to isolated anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 amino acid sequences of preferred anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies, respectively, wherein the antibodies retain the desired functional properties of the parent anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies.

As used herein, the percent homology between two amino acid sequences is equivalent to the percent identity between the two sequences. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % homology=# of identical positions/total # of positions×100), taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm, as described in the non-limiting examples below.

The percent identity between two amino acid sequences can be determined using the algorithm of E. Meyers and W. Miller (Comput. Appl. Biosci., 4:11-17 (1988)) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. In addition, the percent identity between two amino acid sequences can be determined using the Needleman and Wunsch (J. Mol. Biol. 48:444-453 (1970)) algorithm which has been incorporated into the GAP program in the GCG software package (available commercially), using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6.

Additionally or alternatively, the protein sequences of the present invention can further be used as a “query sequence” to perform a search against public databases to, for example, identify related sequences. Such searches can be performed using the XBLAST program (version 2.0) of Altschul, et al. (1990) J Mol. Biol. 215:403-10. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to the antibody molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used.

Antibodies with Conservative Modifications

In certain embodiments, an antibody of the invention comprises a heavy chain variable region comprising CDR1, CDR2 and CDR3 sequences and a light chain variable region comprising CDR1, CDR2 and CDR3 sequences, wherein one or more of these CDR sequences comprise specified amino acid sequences based on preferred anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies isolated and produced using methods herein, or conservative modifications thereof, and wherein the antibodies retain the desired functional properties of the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies of the invention, respectively.

In various embodiments, the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody can be, for example, human antibodies, humanized antibodies or chimeric antibodies.

As used herein, the term “conservative sequence modifications” is intended to refer to amino acid modifications that do not significantly affect or alter the binding characteristics of the antibody containing the amino acid sequence. Such conservative modifications include amino acid substitutions, additions and deletions. Modifications can be introduced into an antibody of the invention by standard techniques known in the art, such as site-directed mutagenesis and PCR-mediated mutagenesis. Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine, tryptophan), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, one or more amino acid residues within the CDR regions of an antibody of the invention can be replaced with other amino acid residues from the same side chain family and the altered antibody can be tested for retained function (i.e., the functions set forth in (c) through (j) above) using the functional assays described herein.

Antibodies that Bind to the Same Epitope as Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32, or Anti-FXYD3 Antibodies of the Invention

In another embodiment, the invention provides antibodies that bind to preferred epitopes on human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 which possess desired functional properties such as modulation of B7 co-stimulation and related functions. Other antibodies with desired epitope specificity may be selected and will have the ability to cross-compete for binding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen with the desired antibodies.

Engineered and Modified Antibodies

An antibody of the invention further can be prepared using an antibody having one or more of the VH and/or VL sequences derived from an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody starting material to engineer a modified antibody, which modified antibody may have altered properties from the starting antibody. An antibody can be engineered by modifying one or more residues within one or both variable regions (i.e., VH and/or VL), for example within one or more CDR regions and/or within one or more framework regions. Additionally or alternatively, an antibody can be engineered by modifying residues within the constant regions, for example to alter the effector functions of the antibody.

One type of variable region engineering that can be performed is CDR grafting. Antibodies interact with target antigens predominantly through amino acid residues that are located in the six heavy and light chain complementarity determining regions (CDRs). For this reason, the amino acid sequences within CDRs are more diverse between individual antibodies than sequences outside of CDRs. Because CDR sequences are responsible for most antibody-antigen interactions, it is possible to express recombinant antibodies that mimic the properties of specific naturally occurring antibodies by constructing expression vectors that include CDR sequences from the specific naturally occurring antibody grafted onto framework sequences from a different antibody with different properties (see, e.g., Riechmann, L. et al. (1998) Nature 332:323-327; Jones, P. et al. (1986) Nature 321:522-525; Queen, C. et al. (1989) Proc. Natl. Acad. See. U.S.A. 86:10029-10033; U.S. Pat. No. 5,225,539 to Winter, and U.S. Pat. Nos. 5,530,101; 5,585,089; 5,693,762 and 6,180,370 to Queen et al.)

Suitable framework sequences can be obtained from public DNA databases or published references that include germline antibody gene sequences. For example, germline DNA sequences for human heavy and light chain variable region genes can be found in the “VBase” human germline sequence database (available on the Internet), as well as in Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242; Tomlinson, I. M., et al. (1992) “The Repertoire of Human Germline VH Sequences Reveals about Fifty Groups of VH Segments with Different Hypervariable Loops” J. Mol. Biol. 227:776-798; and Cox, J. P. L. et al. (1994) “A Directory of Human Germ-line VH Segments Reveals a Strong Bias in their Usage” Eur. J Immunol. 24:827-836; the contents of each of which are expressly incorporated herein by reference.

Another type of variable region modification is to mutate amino acid residues within the VH and/or VL CDR 1, CDR2 and/or CDR3 regions to thereby improve one or more binding properties (e.g., affinity) of the antibody of interest. Site-directed mutagenesis or PCR-mediated mutagenesis can be performed to introduce the mutations and the effect on antibody binding, or other functional property of interest, can be evaluated in appropriate in vitro or in vivo assays. Preferably conservative modifications (as discussed above) are introduced. The mutations may be amino acid substitutions, additions or deletions, but are preferably substitutions. Moreover, typically no more than one, two, three, four or five residues within a CDR region are altered.

Engineered antibodies of the invention include those in which modifications have been made to framework residues within VH and/or VL, e.g. to improve the properties of the antibody. Typically such framework modifications are made to decrease the immunogenicity of the antibody. For example, one approach is to “backmutate” one or more framework residues to the corresponding germline sequence. More specifically, an antibody that has undergone somatic mutation may contain framework residues that differ from the germline sequence from which the antibody is derived. Such residues can be identified by comparing the antibody framework sequences to the germline sequences from which the antibody is derived.

In addition or alternative to modifications made within the framework or CDR regions, antibodies of the invention may be engineered to include modifications within the Fc region, typically to alter one or more functional properties of the antibody, such as serum half-life, complement fixation, Fc receptor binding, and/or antigen-dependent cellular cytotoxicity. Furthermore, an antibody of the invention may be chemically modified (e.g., one or more chemical moieties can be attached to the antibody) or be modified to alter its glycosylation, again to alter one or more functional properties of the antibody. Such embodiments are described further below. The numbering of residues in the Fc region is that of the EU index of Kabat.

In one embodiment, the hinge region of CH1 is modified such that the number of cysteine residues in the hinge region is altered, e.g., increased or decreased. This approach is described further in U.S. Pat. No. 5,677,425 by Bodmer et al. The number of cysteine residues in the hinge region of CH1 is altered to, for example, facilitate assembly of the light and heavy chains or to increase or decrease the stability of the antibody.

In another embodiment, the Fc hinge region of an antibody is mutated to decrease the biological half life of the antibody. More specifically, one or more amino acid mutations are introduced into the CH2-CH3 domain interface region of the Fc-hinge fragment such that the antibody has impaired Staphylococcyl protein A (SpA) binding relative to native Fc-hinge domain SpA binding. This approach is described in further detail in U.S. Pat. No. 6,165,745 by Ward et al.

In another embodiment, the antibody is modified to increase its biological half life. Various approaches are possible. For example, one or more of the following mutations can be introduced: T252L, T2545, T256F, as described in U.S. Pat. No. 6,277,375 to Ward. Alternatively, to increase the biological half life, the antibody can be altered within the CH1 or CL region to contain a salvage receptor binding epitope taken from two loops of a CH2 domain of an Fc region of an IgG, as described in U.S. Pat. Nos. 5,869,046 and 6,121,022 by Presta et al.

In yet other embodiments, the Fc region is altered by replacing at least one amino acid residue with a different amino acid residue to alter the effector functions of the antibody. For example, one or more amino acids selected from amino acid residues 234, 235, 236, 237, 297, 318, 320 and 322 can be replaced with a different amino acid residue such that the antibody has an altered affinity for an effector ligand but retains the antigen-binding ability of the parent antibody. The effector ligand to which affinity is altered can be, for example, an Fc receptor or the C1 component of complement. This approach is described in further detail in U.S. Pat. Nos. 5,624,821 and 5,648,260, both by Winter et al.

In another example, one or more amino acids selected from amino acid residues 329, 331 and 322 can be replaced with a different amino acid residue such that the antibody has altered C1q binding and/or reduced or abolished complement dependent cytotoxicity (CDC). This approach is described in further detail in U.S. Pat. No. 6,194,551 by Idusogie et al.

In another example, one or more amino acid residues within amino acid positions 231 and 239 are altered to thereby alter the ability of the antibody to fix complement. This approach is described further in PCT Publication WO 94/29351 by Bodmer et al.

In yet another example, the Fc region is modified to increase the ability of the antibody to mediate antibody dependent cellular cytotoxicity (ADCC) and/or to increase the affinity of the antibody for an Fcy receptor by modifying one or more amino acids at the following positions: 238, 239, 248, 249, 252, 254, 255, 256, 258, 265, 267, 268, 269, 270, 272, 276, 278, 280, 283, 285, 286, 289, 290, 292, 293, 294, 295, 296, 298, 301, 303, 305, 307, 309, 312, 315, 320, 322, 324, 326, 327, 329, 330, 331, 333, 334, 335, 337, 338, 340, 360, 373, 376, 378, 382, 388, 389, 398, 414, 416, 419, 430, 434, 435, 437, 438 or 439. This approach is described further in PCT Publication WO 00/42072 by Presta. Moreover, the binding sites on human IgG1 for Fc grammar, Fc gamma RII, Fc gammaRIII and FcRn have been mapped and variants with improved binding have been described (see Shields, R. L. et al. (2001) J. Biol. Chem. 276:6591-6604). Specific mutations at positions 256, 290, 298, 333, 334 and 339 are shown to improve binding to FcyRIII. Additionally, the following combination mutants are shown to improve Fcgamma.RIII binding: T256A/S298A, S298A/E333A, S298A/K224A and S298A/E333A/K334A.

In still another embodiment, the glycosylation of an antibody is modified. For example, an aglycoslated antibody can be made (i.e., the antibody lacks glycosylation). Glycosylation can be altered to, for example, increase the affinity of the antibody for antigen. Such carbohydrate modifications can be accomplished by, for example, altering one or more sites of glycosylation within the antibody sequence. For example, one or more amino acid substitutions can be made that result in elimination of one or more variable region framework glycosylation sites to thereby eliminate glycosylation at that site. Such aglycosylation may increase the affinity of the antibody for antigen. Such an approach is described in further detail in U.S. Pat. Nos. 5,714,350 and 6,350,861 by Co et al.

Additionally or alternatively, an antibody can be made that has an altered type of glycosylation, such as a hypofucosylated antibody having reduced amounts of fucosyl residues or an antibody having increased bisecting GlcNac structures. Such altered glycosylation patterns have been demonstrated to increase the ADCC ability of antibodies. Such carbohydrate modifications can be accomplished by, for example, expressing the antibody in a host cell with altered glycosylation machinery. Cells with altered glycosylation machinery have been described in the art and can be used as host cells in which to express recombinant antibodies of the invention to thereby produce an antibody with altered glycosylation. For example, the cell lines Ms704, Ms705, and Ms709 lack the fucosyltransferase gene, FUT8 (alpha (1,6) fucosyltransferase), such that antibodies expressed in the Ms704, Ms705, and Ms709 cell lines lack fucose on their carbohydrates. The Ms704, Ms705, and Ms709 FUT8.−/− cell lines are created by the targeted disruption of the FUT8 gene in CHO/DG44 cells using two replacement vectors (see U.S. Patent Publication No. 20040110704 by Yamane et al. and Yamane-Ohnuki et al. (2004) Biotechnol Bioeng 87:614-22). As another example, EP 1,176,195 by Hanai et al. describes a cell line with a functionally disrupted FUT8 gene, which encodes a fucosyl transferase, such that antibodies expressed in such a cell line exhibit hypofucosylation by reducing or eliminating the alpha 1,6 bond-related enzyme. Hanai et al. also describe cell lines which have a low enzyme activity for adding fucose to the N-acetylglucosamine that binds to the Fc region of the antibody or does not have the enzyme activity, for example the rat myeloma cell line YB2/0 (ATCC CRL 1662). PCT Publication WO 03/035835 by Presta describes a variant CHO cell line, Lec13 cells, with reduced ability to attach fucose to Asn(297)-linked carbohydrates, also resulting in hypofucosylation of antibodies expressed in that host cell (see also Shields, R. L. et al. (2002) J. Biol. Chem. 277:26733-26740). PCT Publication WO 99/54342 by Umana et al. describes cell lines engineered to express glycoprotein-modifying glycosyl transferases (e.g., beta(1,4)-N-acetylglucosaminyltransferase III (GnTIII)) such that antibodies expressed in the engineered cell lines exhibit increased bisecting GlcNac structures which results in increased ADCC activity of the antibodies (see also Umana et al. (1999) Nat. Biotech. 17:176-180). Alternatively, the fucose residues of the antibody may be cleaved off using a fucosidase enzyme. For example, the fucosidase alpha-L-fucosidase removes fucosyl residues from antibodies (Tarentino, A. L. et al. (1975) Biochem. 14:5516-23).

Another modification of the antibodies herein that is contemplated by the invention is pegylation. An antibody can be pegylated to, for example, increase the biological (e.g., serum) half life of the antibody. To pegylate an antibody, the antibody, or fragment thereof, typically is reacted with polyethylene glycol (PEG), such as a reactive ester or aldehyde derivative of PEG, under conditions in which one or more PEG groups become attached to the antibody or antibody fragment. Preferably, the pegylation is carried out via an acylation reaction or an alkylation reaction with a reactive PEG molecule (or an analogous reactive water-soluble polymer). As used herein, the term “polyethylene glycol” is intended to encompass any of the forms of PEG that have been used to derivatize other proteins, such as mono (C1-C10) alkoxy- or aryloxy-polyethylene glycol or polyethylene glycol-maleimide. In certain embodiments, the antibody to be pegylated is an aglycosylated antibody. Methods for pegylating proteins are known in the art and can be applied to the antibodies of the invention. See for example, EP 0 154 316 by Nishimura et al. and EP 0 401 384 by Ishikawa et al.

Methods of Engineering Antibodies

As discussed above, the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies having VH and VK sequences disclosed herein can be used to create new anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies, respectively, by modifying the VH and/or VL sequences, or the constant regions attached thereto. Thus, in another aspect of the invention, the structural features of an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody of the invention, are used to create structurally related anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies that retain at least one functional property of the antibodies of the invention, such as binding to human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3, respectively. For example, one or more CDR regions of one VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antibody or mutations thereof, can be combined recombinantly with known framework regions and/or other CDRs to create additional, recombinantly-engineered, anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies of the invention, as discussed above. Other types of modifications include those described in the previous section. The starting material for the engineering method is one or more of the VH and/or VK sequences provided herein, or one or more CDR regions thereof. To create the engineered antibody, it is not necessary to actually prepare (i.e., express as a protein) an antibody having one or more of the VH and/or VK sequences provided herein, or one or more CDR regions thereof. Rather, the information contained in the sequences is used as the starting material to create a “second generation” sequences derived from the original sequences and then the “second generation” sequences is prepared and expressed as a protein.

Standard molecular biology techniques can be used to prepare and express altered antibody sequence.

Preferably, the antibody encoded by the altered antibody sequences is one that retains one, some or all of the functional properties of the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies, respectively, produced by methods and with sequences provided herein, which functional properties include binding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen with a specific KD level or less and/or modulating B7 costimulation and/or selectively binding to desired target cells such as lung cancer, ovarian cancer, colon cancer, that express VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen.

The functional properties of the altered antibodies can be assessed using standard assays available in the art and/or described herein.

In certain embodiments of the methods of engineering antibodies of the invention, mutations can be introduced randomly or selectively along all or part of an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, ani-C1ORF32, or anti-FXYD3 antibody coding sequence and the resulting modified anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, ani-C1ORF32, or anti-FXYD3 antibodies can be screened for binding activity and/or other desired functional properties.

Mutational methods have been described in the art. For example, PCT Publication WO 02/092780 by Short describes methods for creating and screening antibody mutations using saturation mutagenesis, synthetic ligation assembly, or a combination thereof. Alternatively, PCT Publication WO 03/074679 by Lazar et al. describes methods of using computational screening methods to optimize physiochemical properties of antibodies.

Nucleic Acid Molecules Encoding Antibodies of the Invention

Another aspect of the invention pertains to nucleic acid molecules that encode the antibodies of the invention. The nucleic acids may be present in whole cells, in a cell lysate, or in a partially purified or substantially pure form. A nucleic acid is “isolated” or “rendered substantially pure” when purified away from other cellular components or other contaminants, e.g., other cellular nucleic acids or proteins, by standard techniques, including alkaline/SDS treatment, CsCl banding, column chromatography, agarose gel electrophoresis and others well known in the art. See, F. Ausubel, et al., ed. (1987) Current Protocols in Molecular Biology, Greene Publishing and Wiley Interscience, New York. A nucleic acid of the invention can be, for example, DNA or RNA and may or may not contain intronic sequences. In a preferred embodiment, the nucleic acid is a cDNA molecule.

Nucleic acids of the invention can be obtained using standard molecular biology techniques. For antibodies expressed by hybridomas (e.g., hybridomas prepared from transgenic mice carrying human immunoglobulin genes as described further below), cDNAs encoding the light and heavy chains of the antibody made by the hybridoma can be obtained by standard PCR amplification or cDNA cloning techniques. For antibodies obtained from an immunoglobulin gene library (e.g., using phage display techniques), nucleic acid encoding the antibody can be recovered from the library.

Once DNA fragments encoding VH and VL segments are obtained, these DNA fragments can be further manipulated by standard recombinant DNA techniques, for example to convert the variable region genes to full-length antibody chain genes, to Fab fragment genes or to a scFv gene. In these manipulations, a VL- or VH-encoding DNA fragment is operatively linked to another DNA fragment encoding another protein, such as an antibody constant region or a flexible linker.

The term “operatively linked”, as used in this context, is intended to mean that the two DNA fragments are joined such that the amino acid sequences encoded by the two DNA fragments remain in-frame.

The isolated DNA encoding the VH region can be converted to a full-length heavy chain gene by operatively linking the VH-encoding DNA to another DNA molecule encoding heavy chain constant regions (CH1, CH2 and CH3). The sequences of human heavy chain constant region genes are known in the art (see e.g., Kabat, E. A., el al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242) and DNA fragments encompassing these regions can be obtained by standard PCR amplification. The heavy chain constant region can be an IgG1, IgG2, IgG3, IgG4, IgA, IgE, IgM or IgD constant region, but most preferably is an IgG1 or IgG4 constant region. For a Fab fragment heavy chain gene, the VH-encoding DNA can be operatively linked to another DNA molecule encoding only the heavy chain CH1 constant region.

The isolated DNA encoding the VL region can be converted to a full-length light chain gene (as well as a Fab light chain gene) by operatively linking the VL-encoding DNA to another DNA molecule encoding the light chain constant region, CL. The sequences of human light chain constant region genes are known in the art (see e.g., Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242) and DNA fragments encompassing these regions can be obtained by standard PCR amplification. The light chain constant region can be a kappa or lambda constant region, but most preferably is a kappa constant region.

To create a scFv gene, the VH- and VL-encoding DNA fragments are operatively linked to another fragment encoding a flexible linker, e.g., encoding the amino acid sequence (Gly4-Ser)3, such that the VH and VL sequences can be expressed as a contiguous single-chain protein, with the VL and VH regions joined by the flexible linker (see e.g., Bird et al. (1988) Science 242:423-426; Huston et al. (1988) Proc. Natl. Acad. Sci. USA 85:5879-5883; McCafferty et al., (1990) Nature 348:552-554).

Production Of Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Ani-C1ORF32, or Anti-FXYD3 Monoclonal Antibodies of the Invention

Monoclonal antibodies (mAbs) of the present invention can be produced by a variety of techniques, including conventional monoclonal antibody methodology e.g., the standard somatic cell hybridization technique of Kohler and Milstein (1975) Nature 256:495. Although somatic cell hybridization procedures are preferred, in principle, other techniques for producing monoclonal antibody can be employed e.g., viral or oncogenic transformation of B lymphocytes.

A preferred animal system for preparing hybridomas is the murine system. Hybridoma production in the mouse is a very well-established procedure. Immunization protocols and techniques for isolation of immunized splenocytes for fusion are known in the art. Fusion partners (e.g., murine myeloma cells) and fusion procedures are also known.

Chimeric or humanized antibodies of the present invention can be prepared based on the sequence of a murine monoclonal antibody prepared as described above. DNA encoding the heavy and light chain immunoglobulins can be obtained from the murine hybridoma of interest and engineered to contain non-murine (e.g., human) immunoglobulin sequences using standard molecular biology techniques. For example, to create a chimeric antibody, the murine variable regions can be linked to human constant regions using methods known in the art (see e.g., U.S. Pat. No. 4,816,567 to Cabilly et al.). To create a humanized antibody, the murine CDR regions can be inserted into a human framework using methods known in the art (see e.g., U.S. Pat. No. 5,225,539 to Winter, and U.S. Pat. Nos. 5,530,101; 5,585,089; 5,693,762 and 6,180,370 to Queen et al.).

In a preferred embodiment, the antibodies of the invention are human monoclonal antibodies. Such human monoclonal antibodies directed against VSIG1 can be generated using transgenic or transchromosomic mice carrying parts of the human immune system rather than the mouse system. These transgenic and transchromosomic mice include mice referred to herein as the HuMAb Mouse RTM and KM Mouse. RTM. respectively, and are collectively referred to herein as “human Ig mice.” The HuMAb Mouse™. (Medarex. Inc.) contains human immunoglobulin gene miniloci that encode unrearranged human heavy (.mu. and .gamma.) and .kappa. light chain immunoglobulin sequences, together with targeted mutations that inactivate the endogenous .mu. and .kappa. chain loci (see e.g., Lonberg, et al. (1994) Nature 368(6474): 856-859). Accordingly, the mice exhibit reduced expression of mouse IgM or .kappa., and in response to immunization, the introduced human heavy and light chain transgenes undergo class switching and somatic mutation to generate high affinity human IgGkappa. monoclonal (Lonberg, N. et al. (1994), supra; reviewed in Lonberg, N. (1994) Handbook of Experimental Pharmacology 113:49-101; Lonberg, N. and Huszar, D. (1995) Intern. Rev. Immunol. 13: 65-93, and Harding, F. and Lonberg, N. (1995) Ann. N.Y. Acad. Sci. 764:536-546). The preparation and use of the HuMab Mouse RTM., and the genomic modifications carried by such mice, is further described in Taylor, L. et al. (1992) Nucleic Acids Research 20:6287-6295; Chen, J. et al. (1993) International Immunology 5:647-656; Tuaillon et al. (1993) Proc. Natl. Acad. Sci. USA 90:3720-3724; Choi et al. (1993) Nature Genetics 4:117-123; Chen, J. et al. (1993) EMBO J. 12: 821-830; Tuaillon et al. (1994) J. Immunol. 152:2912-2920; Taylor, L. et al. (1994) International Immunology 6:579-591; and Fishwild, D. et al. (1996) Nature Biotechnology 14: 845-851, the contents of all of which are hereby specifically incorporated by reference in their entirety. See further, U.S. Pat. Nos. 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,789,650; 5,877,397; 5,661,016; 5,814,318; 5,874,299; and 5,770,429; all to Lonberg and Kay; U.S. Pat. No. 5,545,807 to Surani et al.; PCT Publication Nos. WO 92/03918, WO 93/12227, WO 94/25585, WO 97/13852, WO 98/24884 and WO 99/45962, all to Lonberg and Kay; and PCT Publication No. WO 01/14424 to Korman et al.

In another embodiment, human antibodies of the invention can be raised using a mouse that carries human immunoglobulin sequences on transgenes and transchomosomes, such as a mouse that carries a human heavy chain transgene and a human light chain transchromosome. Such mice, referred to herein as “KM mice TM.”, are described in detail in PCT Publication WO 02/43478 to Ishida et al.

Still further, alternative transgenic animal systems expressing human immunoglobulin genes are available in the art and can be used to raise anti-VSIG1 antibodies of the invention. For example, an alternative transgenic system referred to as the Xenomouse (Abgenix, Inc.) can be used; such mice are described in, for example, U.S. Pat. Nos. 5,939,598; 6,075,181; 6,114,598; 6, 150,584 and 6,162,963 to Kucherlapati et al.

Moreover, alternative transchromosomic animal systems expressing human immunoglobulin genes are available in the art and can be used to raise anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, ani-C1ORF32, or anti-FXYD3 antibodies of the invention. For example, mice carrying both a human heavy chain transchromosome and a human light chain transchromosome, referred to as “TC mice” can be used; such mice are described in Tomizuka et al. (2000) Proc. Natl. Acad Sci. USA 97:722-727. Furthermore, cows carrying human heavy and light chain transchromosomes have been described in the art (Kuroiwa et al. (2002) Nature Biotechnology 20:889-894) and can be used to raise anti-VSIG1 antibodies of the invention.

Human monoclonal antibodies of the invention can also be prepared using phage display methods for screening libraries of human immunoglobulin genes. Such phage display methods for isolating human antibodies are established in the art. See for example: U.S. Pat. Nos. 5,223,409; 5,403,484; and U.S. Pat. No. 5,571,698 to Ladner et al.; U.S. Pat. Nos. 5,427,908 and 5,580,717 to Dower et al.; U.S. Pat. Nos. 5,969,108 and 6,172,197 to McCafferty et al.; and U.S. Pat. Nos. 5,885,793; 6,521,404; 6,544,731; 6,555,313; 6,582,915 and 6,593,081 to Griffiths et al.

Human monoclonal antibodies of the invention can also be prepared using SCID mice into which human immune cells have been reconstituted such that a human antibody response can be generated upon immunization. Such mice are described in, for example, U.S. Pat. Nos. 5,476,996 and 5,698,767 to Wilson et al.

Immunization of Human Ig Mice

When human Ig mice are used to raise human antibodies of the invention, such mice can be immunized with a purified or enriched preparation of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen and/or recombinant VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXY, or an VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 fusion protein, as described by Lonberg, N. et al. (1994) Nature 368(6474): 856-859; Fishwild, D. et al. (1996) Nature Biotechnology 14: 845-851; and PCT Publication WO 98/24884 and WO 01/14424. Preferably, the mice will be 6-16 weeks of age upon the first infusion. For example, a purified or recombinant preparation (5-50 .mu.g) of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen can be used to immunize the human Ig mice intraperitoneally.

Prior experience with various antigens by others has shown that the transgenic mice respond when initially immunized intraperitoneally (IP) with antigen in complete Freund's adjuvant, followed by every other week IP immunizations (up to a total of 6) with antigen in incomplete Freund's adjuvant. However, adjuvants other than Freund's are also found to be effective. In addition, whole cells in the absence of adjuvant are found to be highly immunogenic. The immune response can be monitored over the course of the immunization protocol with plasma samples being obtained by retroorbital bleeds. The plasma can be screened by ELISA (as described below), and mice with sufficient titers of anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 human immunoglobulin can be used for fusions. Mice can be boosted intravenously with antigen 3 days before sacrifice and removal of the spleen. It is expected that 2-3 fusions for each immunization may need to be performed. Between 6 and 24 mice are typically immunized for each antigen. Usually both HCo7 and HCo12 strains are used. In addition, both HCo7 and HCo12 transgene can be bred together into a single mouse having two different human heavy chain transgenes (HCo7/HCo 12). Alternatively or additionally, the KM Mouse. RTM. strain can be used.

Generation of Hybridomas Producing Human Monoclonal Antibodies of the Invention

To generate hybridomas producing human monoclonal antibodies of the invention, splenocytes and/or lymph node cells from immunized mice can be isolated and fused to an appropriate immortalized cell line, such as a mouse myeloma cell line. The resulting hybridomas can be screened for the production of antigen-specific antibodies. For example, single cell suspensions of splenic lymphocytes from immunized mice can be fused to one-sixth the number of P3X63-Ag8.653 nonsecreting mouse myeloma cells (ATCC, CRL 1580) with 50% PEG. Cells are plated at approximately 2×10-5 in flat bottom microtiter plate, followed by a two week incubation in selective medium containing 20% fetal Clone Serum, 18% “653” conditioned media, 5% origen (IGEN), 4 mM L-glutamine, 1 mM sodium pyruvate, 5 mM HEPES, 0.055 mM 2-mercaptoethanol, 50 units/ml penicillin, 50 mg/ml streptomycin, 50 mg/ml gentamycin and 1×HAT (Sigma; the HAT is added 24 hours after the fusion). After approximately two weeks, cells can be cultured in medium in which the HAT is replaced with HT. Individual wells can then be screened by ELISA for human monoclonal IgM and IgG antibodies. Once extensive hybridoma growth occurs, medium can be observed usually after 10-14 days. The antibody secreting hybridomas can be replated, screened again, and if still positive for human IgG, the monoclonal antibodies can be subcloned at least twice by limiting dilution. The stable subclones can then be cultured in vitro to generate small amounts of antibody in tissue culture medium for characterization.

To purify human monoclonal antibodies, selected hybridomas can be grown in two-liter spinner-flasks for monoclonal antibody purification. Supernatants can be filtered and concentrated before affinity chromatography with protein A-Sepharose (Pharmacia, Piscataway, N.J.). Eluted IgG can be checked by gel electrophoresis and high performance liquid chromatography to ensure purity. The buffer solution can be exchanged into PBS, and the concentration can be determined by OD280 using 1.43 extinction coefficient. The monoclonal antibodies can be aliquoted and stored at −80 degrees C.

Generation of Transfectomas Producing Monoclonal Antibodies of the Invention

Antibodies of the invention also can be produced in a host cell transfectoma using, for example, a combination of recombinant DNA techniques and gene transfection methods as is well known in the art (e.g., Morrison, S. (1985) Science 229:1202).

For example, to express the antibodies, or antibody fragments thereof, DNAs encoding partial or full-length light and heavy chains, can be obtained by standard molecular biology techniques (e.g., PCR amplification or cDNA cloning using a hybridoma that expresses the antibody of interest) and the DNAs can be inserted into expression vectors such that the genes are operatively linked to transcriptional and translational control sequences. In this context, the term “operatively linked” is intended to mean that an antibody gene is ligated into a vector such that transcriptional and translational control sequences within the vector serve their intended function of regulating the transcription and translation of the antibody gene. The expression vector and expression control sequences are chosen to be compatible with the expression host cell used. The antibody light chain gene and the antibody heavy chain gene can be inserted into separate vector or, more typically, both genes are inserted into the same expression vector. The antibody genes are inserted into the expression vector by standard methods (e.g., ligation of complementary restriction sites on the antibody gene fragment and vector, or blunt end ligation if no restriction sites are present). The light and heavy chain variable regions of the antibodies described herein can be used to create full-length antibody genes of any antibody isotype by inserting them into expression vectors already encoding heavy chain constant and light chain constant regions of the desired isotype such that the VH segment is operatively linked to the CH segments within the vector and the VK segment is operatively linked to the CL segment within the vector. Additionally or alternatively, the recombinant expression vector can encode a signal peptide that facilitates secretion of the antibody chain from a host cell. The antibody chain gene can be cloned into the vector such that the signal peptide is linked in-frame to the amino terminus of the antibody chain gene. The signal peptide can be an immunoglobulin signal peptide or a heterologous signal peptide (i.e., a signal peptide from a non-immunoglobulin protein).

In addition to the antibody chain genes, the recombinant expression vectors of the invention carry regulatory sequences that control the expression of the antibody chain genes in a host cell. The term “regulatory sequence” is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals) that control the transcription or translation of the antibody chain genes. Such regulatory sequences are described, for example, in Goeddel (Gene Expression Technology. Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990)). It will be appreciated by those skilled in the art that the design of the expression vector, including the selection of regulatory sequences, may depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, etc. Preferred regulatory sequences for mammalian host cell expression include viral elements that direct high levels of protein expression in mammalian cells, such as promoters and/or enhancers derived from cytomegalovirus (CMV), Simian Virus 40 (SV40), adenovirus, (e.g., the adenovirus major late promoter (AdMLP) and polyoma. Alternatively, nonviral regulatory sequences may be used, such as the ubiquitin promoter or .beta.-globin promoter. Still further, regulatory elements composed of sequences from different sources, such as the SR alpha. promoter system, which contains sequences from the SV40 early promoter and the long terminal repeat of human T cell leukemia virus type 1 (Takebe, Y. et al. (1988) Mol. Cell. Biol. 8:466-472).

In addition to the antibody chain genes and regulatory sequences, the recombinant expression vectors of the invention may carry additional sequences, such as sequences that regulate replication of the vector in host cells (e.g., origins of replication) and selectable marker genes. The selectable marker gene facilitates selection of host cells into which the vector has been introduced (see, e.g., U.S. Pat. Nos. 4,399,216, 4,634,665 and 5,179,017, all by Axel et al.). For example, typically the selectable marker gene confers resistance to drugs, such as G418, hygromycin or methotrexate, on a host cell into which the vector has been introduced. Preferred selectable marker genes include the dihydrofolate reductase (DHFR) gene (for use in dhfr− host cells with methotrexate selection/amplification) and the neo gene (for G418 selection).

For expression of the light and heavy chains, the expression vectors encoding the heavy and light chains is transfected into a host cell by standard techniques. The various forms of the term “transfection” are intended to encompass a wide variety of techniques commonly used for the introduction of exogenous DNA into a prokaryotic or eukaryotic host cell, e.g., electroporation, calcium-phosphate precipitation, DEAE-dextran transfection and the like. Although it is theoretically possible to express the antibodies of the invention in either prokaryotic or eukaryotic host cells, expression of antibodies in eukaryotic cells, and most preferably mammalian host cells, is the most preferred because such eukaryotic cells, and in particular mammalian cells, are more likely than prokaryotic cells to assemble and secrete a properly folded and immunologically active antibody. Prokaryotic expression of antibody genes has been reported to be ineffective for production of high yields of active antibody (Boss, M. A. and Wood, C. R. (1985) Immunology Today 6:12-13).

Preferred mammalian host cells for expressing the recombinant antibodies of the invention include Chinese Hamster Ovary (CHO cells) (including dhfr-CHO cells, described in Urlaub and Chasin, (1980) Proc. Natl. Acad. Sci. USA 77:4216-4220, used with a DHFR selectable marker, e.g., as described in R. J. Kaufman and P. A. Sharp (1982) Mol. Biol. 159:601-621), NSO myeloma cells, COS cells and SP2 cells. In particular, for use with NSO myeloma cells, another preferred expression system is the GS gene expression system disclosed in WO 87/04462, WO 89/01036 and EP 338,841. When recombinant expression vectors encoding antibody genes are introduced into mammalian host cells, the antibodies are produced by culturing the host cells for a period of time sufficient to allow for expression of the antibody in the host cells or, more preferably, secretion of the antibody into the culture medium in which the host cells are grown. Antibodies can be recovered from the culture medium using standard protein purification methods.

Characterization of Antibody Binding to Antigen

Antibodies of the invention can be tested for binding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 by, for example, standard ELISA. Briefly, microtiter plates are coated with purified VSIG1 at 0.25 .mu.g/ml in PBS, and then blocked with 5% bovine serum albumin in PBS. Dilutions of antibody (e.g., dilutions of plasma from VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3-immunized mice) are added to each well and incubated for 1-2 hours at 37 degrees C. The plates are washed with PBS/Tween and then incubated with secondary reagent (e.g., for human antibodies, a goat-anti-human IgG Fc-specific polyclonal reagent) conjugated to alkaline phosphatase for 1 hour at 37 degrees C. After washing, the plates are developed with pNPP substrate (1 mg/ml), and analyzed at OD of 405-650. Preferably, mice which develop the highest titers will be used for fusions.

An ELISA assay as described above can also be used to screen for hybridomas that show positive reactivity with VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 immunogen. Hybridomas that bind with high avidity to VSIG1 are subcloned and further characterized. One clone from each hybridoma, which retains the reactivity of the parent cells (by ELISA), can be chosen for making a 5-10 vial cell bank stored at −140 degrees C., and for antibody purification.

To purify anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibodies, selected hybridomas can be grown in two-liter spinner-flasks for monoclonal antibody purification. Supernatants can be filtered and concentrated before affinity chromatography with protein A-sepharose (Pharmacia, Piscataway, N.J.). Eluted IgG can be checked by gel electrophoresis and high performance liquid chromatography to ensure purity. The buffer solution can be exchanged into PBS, and the concentration can be determined by OD280 using 1.43 extinction coefficient. The monoclonal antibodies can be aliquoted and stored at −80 degrees C.

To determine if the selected anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 monoclonal antibodies bind to unique epitopes, each antibody can be biotinylated using commercially available reagents (Pierce, Rockford, Ill.). Competition studies using unlabeled monoclonal antibodies and biotinylated monoclonal antibodies can be performed using VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 coated-ELISA plates as described above. Biotinylated mAb binding can be detected with a strep-avidin-alkaline phosphatase probe.

To determine the isotype of purified antibodies, isotype ELISAs can be performed using reagents specific for antibodies of a particular isotype. For example, to determine the isotype of a human monoclonal antibody, wells of microtiter plates can be coated with 1 .mu.g/ml of anti-human immunoglobulin overnight at 4 degrees C. After blocking with 1% BSA, the plates are reacted with 1 mug/ml or less of test monoclonal antibodies or purified isotype controls, at ambient temperature for one to two hours. The wells can then be reacted with either human IgG1 or human IgM-specific alkaline phosphatase-conjugated probes. Plates are developed and analyzed as described above.

Anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 human IgGs can be further tested for reactivity with VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen, respectively, by Western blotting. Briefly, VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen can be prepared and subjected to sodium dodecyl sulfate polyacrylamide gel electrophoresis. After electrophoresis, the separated antigens are transferred to nitrocellulose membranes, blocked with 10% fetal calf serum, and probed with the monoclonal antibodies to be tested. Human IgG binding can be detected using anti-human IgG alkaline phosphatase and developed with BCIP/NBT substrate tablets (Sigma Chem. Co., St. Louis, Mo.).

Conjugates or Immunoconjugates

The present invention encompasses conjugates for use in immune therapy comprising the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen and soluble portions thereof including the ectodomain or portions or variants thereof. For example the invention encompasses conjugates wherein the ECD of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen is attached to an immunoglobulin or fragment thereof. The invention contemplates the use thereof for promoting or inhibiting VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen activities such as immune costimulation and the use thereof in treating transplant, autoimmune, and cancer indications described herein.

In another aspect, the present invention features immunoconjugates comprising an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody, or a fragment thereof, conjugated to a therapeutic moiety, such as a cytotoxin, a drug (e.g., an immunosuppressant) or a radiotoxin. Such conjugates are referred to herein as “immunoconjugates”. Immunoconjugates that include one or more cytotoxins are referred to as “immunotoxins.” A cytotoxin or cytotoxic agent includes any agent that is detrimental to (e.g., kills) cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs or homologs thereof. Therapeutic agents also include, for example, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine).

Other preferred examples of therapeutic cytotoxins that can be conjugated to an antibody of the invention include duocarmycins, calicheamicins, maytansines and auristatins, and derivatives thereof. An example of a calicheamicin antibody conjugate is commercially available (Mylotarg™; Wyeth).

Cytotoxins can be conjugated to antibodies of the invention using linker technology available in the art. Examples of linker types that have been used to conjugate a cytotoxin to an antibody include, but are not limited to, hydrazones, thioethers, esters, disulfides and peptide-containing linkers. A linker can be chosen that is, for example, susceptible to cleavage by low pH within the lysosomal compartment or susceptible to cleavage by proteases, such as proteases preferentially expressed in tumor tissue such as cathepsins (e.g., cathepsins B, C, D).

For further discussion of types of cytotoxins, linkers and methods for conjugating therapeutic agents to antibodies, see also Saito, G. et al. (2003) Adv. Drug Deliv. Rev. 55:199-215; Trail, P. A. et al. (2003) Cancer Immunol. Immunother. 52:328-337; Payne, G. (2003) Cancer Cell 3:207-212; Allen, T. M. (2002) Nat. Rev. Cancer 2:750-763; Pastan, I. and Kreitman, R. J. (2002) Curr. Opin. Investig. Drugs 3:1089-1091; Senter, P. D. and Springer, C. J. (2001) Adv. Drug Deliv. Rev. 53:247-264.

Antibodies of the present invention also can be conjugated to a radioactive isotope to generate cytotoxic radiopharmaceuticals, also referred to as radioimmunoconjugates. Examples of radioactive isotopes that can be conjugated to antibodies for use diagnostically or therapeutically include, but are not limited to, iodine 131, indium 111, yttrium 90 and lutetium 177. Method for preparing radioimmunconjugates are established in the art. Examples of radioimmunoconjugates are commercially available, including Zevalin™ (IDEC Pharmaceuticals) and Bexxar™ (Corixa Pharmaceuticals), and similar methods can be used to prepare radioimmunoconjugates using the antibodies of the invention.

The antibody conjugates of the invention can be used to modify a given biological response, and the drug moiety is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for example, an enzymatically active toxin, or active fragment thereof, such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor or interferon-.gamma.; or, biological response modifiers such as, for example, lymphokines, interleukin-1 (“IL-1”), interleukin-2 (“IL-2”), interleukin-6 (“IL-6”), granulocyte macrophage colony stimulating factor (“GM-CSF”), granulocyte colony stimulating factor (“G-CSF”), or other growth factors.

Techniques for conjugating such therapeutic moiety to antibodies are well known, see, e.g., Arnon et al., “Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy”, in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., “Antibodies For Drug Delivery”, in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); Thorpe, “Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review”, in Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 (1985); “Analysis, Results, And Future Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy”, in Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 (Academic Press 1985), and Thorpe et al., “The Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates”, Immunol. Rev., 62:119-58 (1982).

Bispecific Molecules

In another aspect, the present invention features bispecific molecules comprising an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody, or a fragment thereof, of the invention. An antibody of the invention, or antigen-binding portions thereof, can be derivatized or linked to another functional molecule, e.g., another peptide or protein (e.g., another antibody or ligand for a receptor) to generate a bispecific molecule that binds to at least two different binding sites or target molecules. The antibody of the invention may in fact be derivatized or linked to more than one other functional molecule to generate multispecific molecules that bind to more than two different binding sites and/or target molecules; such multispecific molecules are also intended to be encompassed by the term “bispecific molecule” as used herein. To create a bispecific molecule of the invention, an antibody of the invention can be functionally linked (e.g., by chemical coupling, genetic fusion, noncovalent association or otherwise) to one or more other binding molecules, such as another antibody, antibody fragment, peptide or binding mimetic, such that a bispecific molecule results.

Accordingly, the present invention includes bispecific molecules comprising at least one first binding specificity for VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 and a second binding specificity for a second target epitope. In a particular embodiment of the invention, the second target epitope is an Fc receptor, e.g., human Fc gamma RI (CD64) or a human Fc alpha receptor (CD89). Therefore, the invention includes bispecific molecules capable of binding both to Fc gamma. R, Fc alpha R or Fc epsilon R expressing effector cells (e.g., monocytes, macrophages or polymorphonuclear cells (PMNs)), and to target cells expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3, respectively. These bispecific molecules target VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 expressing cells to effector cell and trigger Fc receptor-mediated effector cell activities, such as phagocytosis of an VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 expressing cells, antibody dependent cell-mediated cytotoxicity (ADCC), cytokine release, or generation of superoxide anion.

In an embodiment of the invention in which the bispecific molecule is multispecific, the molecule can further include a third binding specificity, in addition to an anti-Fc binding specificity and an anti-6f binding specificity. In one embodiment, the third binding specificity is an anti-enhancement factor (EF) portion, e.g., a molecule which binds to a surface protein involved in cytotoxic activity and thereby increases the immune response against the target cell.

The “anti-enhancement factor portion” can be an antibody, functional antibody fragment or a ligand that binds to a given molecule, e.g., an antigen or a receptor, and thereby results in an enhancement of the effect of the binding determinants for the Fc receptor or target cell antigen. The “anti-enhancement factor portion” can bind an Fc receptor or a target cell antigen. Alternatively, the anti-enhancement factor portion can bind to an entity that is different from the entity to which the first and second binding specificities bind. For example, the anti-enhancement factor portion can bind a cytotoxic T-cell (e.g., via CD2, CD3, CD8, CD28, CD4, CD40, ICAM-1 or other immune cell that results in an increased immune response against the target cell).

In one embodiment, the bispecific molecules of the invention comprise as a binding specificity at least one antibody, or an antibody fragment thereof, including, e.g., an Fab, Fab′, F(ab′).sub.2, Fv, or a single chain Fv. The antibody may also be a light chain or heavy chain dimer, or any minimal fragment thereof such as a Fv or a single chain construct as described in Ladner et al. U.S. Pat. No. 4,946,778, the contents of which is expressly incorporated by reference.

In one embodiment, the binding specificity for an Fcy receptor is provided by a monoclonal antibody, the binding of which is not blocked by human immunoglobulin G (IgG). As used herein, the term “IgG receptor” refers to any of the eight.gamma.-chain genes located on chromosome 1. These genes encode a total of twelve transmembrane or soluble receptor isoforms which are grouped into three Fc.gamma. receptor classes: Fc gamma R1 (CD64), Fc gamma RII(CD32), and Fc gamma.RIII (CD 16). In one preferred embodiment, the Fc gamma. receptor a human high affinity Fc.gamma RI. The human Fc gammaRI is a 72 kDa molecule, which shows high affinity for monomeric IgG (10 8-10-9 M.-1).

The production and characterization of certain preferred anti-Fc gamma. monoclonal antibodies are described by Fanger et al. in PCT Publication WO 88/00052 and in U.S. Pat. No. 4,954,617, the teachings of which are fully incorporated by reference herein. These antibodies bind to an epitope of Fc.gamma.R1, FcyRII or FcyRIII at a site which is distinct from the Fc.gamma. binding site of the receptor and, thus, their binding is not blocked substantially by physiological levels of IgG. Specific anti-Fc.gamma.RI antibodies useful in this invention are mAb 22, mAb 32, mAb 44, mAb 62 and mAb 197. The hybridoma producing mAb 32 is available from the American Type Culture Collection, ATCC Accession No. HB9469. In other embodiments, the anti-Fcy receptor antibody is a humanized form of monoclonal antibody 22 (H22). The production and characterization of the H22 antibody is described in Graziano, R. F. et al. (1995) J. Immunol. 155 (10): 4996-5002 and PCT Publication WO 94/10332. The H22 antibody producing cell line is deposited at the American Type Culture Collection under the designation HAO22CLI and has the accession no. CRL 11177.

In still other preferred embodiments, the binding specificity for an Fc receptor is provided by an antibody that binds to a human IgA receptor, e.g., an Fc-alpha receptor (Fc alpha.RI(CD89)), the binding of which is preferably not blocked by human immunoglobulin A (IgA). The term “IgA receptor” is intended to include the gene product of one alpha.-gene (Fc alpha.RI) located on chromosome 19. This gene is known to encode several alternatively spliced transmembrane isoforms of 55 to 10 kDa

Fc.alpha.RI (CD89) is constitutively expressed on monocytes/macrophages, eosinophilic and neutrophilic granulocytes, but not on non-effector cell populations. Fc alpha RI has medium affinity (Approximately 5×10-7 M-1) for both IgA1 and IgA2, which is increased upon exposure to cytokines such as G-CSF or GM-CSF (Morton, H. C. et al. (1996) Critical Reviews in Immunology 16:423-440). Four FcaRI-specific monoclonal antibodies, identified as A3, A59, A62 and A77, which bind Fc.alpha.RI outside the IgA ligand binding domain, have been described (Monteiro, R. C. et al. (1992) J. Immunol. 148:1764).

Fc. alpha. RI and Fc gamma. RI are preferred trigger receptors for use in the bispecific molecules of the invention because they are (1) expressed primarily on immune effector cells, e.g., monocytes, PMNs, macrophages and dendritic cells; (2) expressed at high levels (e.g., 5,000-100,000 per cell); (3) mediators of cytotoxic activities (e.g., ADCC, phagocytosis); (4) mediate enhanced antigen presentation of antigens, including self-antigens, targeted to them.

While human monoclonal antibodies are preferred, other antibodies which can be employed in the bispecific molecules of the invention are murine, chimeric and humanized monoclonal antibodies.

The bispecific molecules of the present invention can be prepared by conjugating the constituent binding specificities, e.g., the anti-FcR and anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 binding specificities, using methods known in the art. For example, each binding specificity of the bispecific molecule can be generated separately and then conjugated to one another. When the binding specificities are proteins or peptides, a variety of coupling or cross-linking agents can be used for covalent conjugation. Examples of cross-linking agents include protein A, carbodiimide, N-succinimidyl-S-acetyl-thioacetate (SATA), 5,5′-dithiobis(2-nitrobenzoic acid) (DTNB), o-phenylenedimaleimide (oPDM), N-succinimidyl-3-(2-pyridyld-ithio)propionate (SPDP), and sulfosuccinimidyl 4-(N-maleimidomethyl) cyclohaxane-1-carboxylate (sulfo-SMCC) (see e.g., Karpovsky et al. (1984) J. Exp. Med. 160:1686; Liu, M A et al. (1985) Proc. Natl. Acad. Sci. USA 82:8648). Other methods include those described in Paulus (1985) Behring Ins. Mitt. No. 78, 118-132; Brennan et al. (1985) Science 229:81-83), and Glennie et al. (1987) J. Immunol. 139: 2367-2375). Preferred conjugating agents are SATA and sulfo-SMCC, both available from Pierce Chemical Co. (Rockford, Ill.).

When the binding specificities are antibodies, they can be conjugated via sulfhydryl bonding of the C-terminus hinge regions of the two heavy chains. In a particularly preferred embodiment, the hinge region is modified to contain an odd number of sulfhydryl residues, preferably one, prior to conjugation.

Alternatively, both binding specificities can be encoded in the same vector and expressed and assembled in the same host cell. This method is particularly useful where the bispecific molecule is a mAbXmAb, mAbXFab, FabXF(ab′)2 or ligandXFab fusion protein. A bispecific molecule of the invention can be a single chain molecule comprising one single chain antibody and a binding determinant, or a single chain bispecific molecule comprising two binding determinants. Bispecific molecules may comprise at least two single chain molecules. Methods for preparing bispecific molecules are described for example in U.S. Pat. No. 5,260,203; U.S. Pat. No. 5,455,030; U.S. Pat. No. 4,881,175; U.S. Pat. No. 5,132,405; U.S. Pat. No. 5,091,513; U.S. Pat. No. 5,476,786; U.S. Pat. No. 5,013,653; U.S. Pat. No. 5,258,498; and U.S. Pat. No. 5,482,858.

Binding of the bispecific molecules to their specific targets can be confirmed by, for example, enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), FACS analysis, bioassay (e.g., growth inhibition), or Western Blot assay. Each of these assays generally detects the presence of protein-antibody complexes of particular interest by employing a labeled reagent (e.g., an antibody) specific for the complex of interest. For example, the FcR-antibody complexes can be detected using e.g., an enzyme-linked antibody or antibody fragment which recognizes and specifically binds to the antibody-FcR complexes. Alternatively, the complexes can be detected using any of a variety of other immunoassays. For example, the antibody can be radioactively labeled and used in a radioimmunoassay (RIA) (see, for example, Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is incorporated by reference herein). The radioactive isotope can be detected by such means as the use of a gamma. counter or a scintillation counter or by autoradiography.

PHARMACEUTICAL COMPOSITIONS

In another aspect, the present invention provides a composition, e.g., a pharmaceutical composition, containing one or a combination of monoclonal antibodies, or antigen-binding portions thereof, of the present invention, formulated together with a pharmaceutically acceptable carrier. Such compositions may include one or a combination of (e.g., two or more different) antibodies, or immunoconjugates or bispecific molecules of the invention. For example, a pharmaceutical composition of the invention can comprise a combination of antibodies (or immunoconjugates or bispecifics) that bind to different epitopes on the target antigen or that have complementary activities.

As discussed supra, VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 the invention further embraces identifying other molecules such as small organic molecules, peptides, ribozymes, carbohydrates, glycoprotein, siRNAs, antisense RNAs and the like which specifically bind and/or modulate (enhance or inhibit) an activity elicited by the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen, respectively. These molecules may be identified by known screening methods such as binding assays. Typically these assays will be high throughput and will screen a large library of synthesized or native compounds in order to identify putative drug candidates that bind and/or modulate VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 related activities.

Specifically, the invention embraces the development of drugs containing the ectodomain of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen or a fragment or variant thereof or a corresponding nucleic acid sequence encoding. These conjugates may contain a targeting or other moiety such as an immunoglobulin domain. These conjugates may be expressed in known vector systems or cells or vectors containing the corresponding nucleic acid sequences may be used for cancer treatment and in immune therapy such as in the treatment of autoimmunity, transplant, GVHD, cancer, and other immune disorders or conditions.

Thus, the present invention features a pharmaceutical composition comprising a therapeutically effective amount of a therapeutic agent according to the present invention. According to the present invention the therapeutic agent could be any one of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 ectodomain, or a fragment or variant thereof, or a corresponding nucleic acid sequence encoding.

The pharmaceutical composition according to the present invention is further preferably used for the treatment of cancers including by way of example non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic.

The pharmaceutical composition according to the present invention is further used for the treatment of autoimmunity and preferably for treating an autoimmune disease selected from: Multiple sclerosis; Psoriasis; Rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

The pharmaceutical composition according to the present invention is preferably used for the treatment of for rejection of any organ transplant and/or Graft versus host disease which might develop after bone marrow transplantation.

“Treatment” refers to both therapeutic treatment and prophylactic or preventative measures. Those in need of treatment include those already with the disorder as well as those in which the disorder is to be prevented. Hence, the mammal to be treated herein may have been diagnosed as having the disorder or may be predisposed or susceptible to the disorder. “Mammal” for purposes of treatment refers to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, horses, cats, cows, etc. Preferably, the mammal is human.

The term “therapeutically effective amount” refers to an amount of agent according to the present invention that is effective to treat a disease or disorder in a mammal.

The therapeutic agents of the present invention can be provided to the subject alone, or as part of a pharmaceutical composition where they are mixed with a pharmaceutically acceptable carrier.

Pharmaceutical compositions of the invention also can be administered in combination therapy, i.e., combined with other agents. For example, the combination therapy can include an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 modulating agent according to the present invention such as a soluble polypeptide conjugate containing the ectodomain of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen or a small molecule such as a peptide, ribozyme, siRNA, or other drug that binds VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 combined with at least one other therapeutic or immune modulatory agent. Examples of therapeutic agents that can be used in combination therapy are described in greater detail below in the section on uses of the antibodies of the invention.

As used herein, “pharmaceutically acceptable carrier” includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like that are physiologically compatible. Preferably, the carrier is suitable for intravenous, intramuscular, subcutaneous, parenteral, spinal or epidermal administration (e.g., by injection or infusion). Depending on the route of administration, the active compound, i.e., antibody, immunoconjugate, or bispecific molecule, may be coated in a material to protect the compound from the action of acids and other natural conditions that may inactivate the compound. The pharmaceutical compounds of the invention may include one or more pharmaceutically acceptable salts. A “pharmaceutically acceptable salt” refers to a salt that retains the desired biological activity of the parent compound and does not impart any undesired toxicological effects (see e.g., Berge, S. M., et al. (1977) J. Pharm. Sci. 66: 1-19). Examples of such salts include acid addition salts and base addition salts. Acid addition salts include those derived from nontoxic inorganic acids, such as hydrochloric, nitric, phosphoric, sulfuric, hydrobromic, hydroiodic, phosphorous and the like, as well as from nontoxic organic acids such as aliphatic mono- and dicarboxylic acids, phenyl-substituted alkanoic acids, hydroxy alkanoic acids, aromatic acids, aliphatic and aromatic sulfonic acids and the like. Base addition salts include those derived from alkaline earth metals, such as sodium, potassium, magnesium, calcium and the like, as well as from nontoxic organic amines, such as N,N′-dibenzylethylenediamine, N-methylglucamine, chloroprocaine, choline, diethanolamine, ethylenediamine, procaine and the like.

A pharmaceutical composition of the invention also may include a pharmaceutically acceptable anti-oxidant. Examples of pharmaceutically acceptable antioxidants include: (1) water soluble antioxidants, such as ascorbic acid, cysteine hydrochloride, sodium bisulfate, sodium metabisulfite, sodium sulfite and the like; (2) oil-soluble antioxidants, such as ascorbyl palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, propyl gallate, alpha-tocopherol, and the like; and (3) metal chelating agents, such as citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric acid, and the like.

A pharmaceutical composition of the invention also may include a pharmaceutically acceptable anti-oxidant. Examples of pharmaceutically acceptable antioxidants include: (1) water soluble antioxidants, such as ascorbic acid, cysteine hydrochloride, sodium bisulfate, sodium metabisulfite, sodium sulfite and the like; (2) oil-soluble antioxidants, such as ascorbyl palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, propyl gallate, alpha-tocopherol, and the like; and (3) metal chelating agents, such as citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric acid, and the like. Examples of suitable aqueous and nonaqueous carriers that may be employed in the pharmaceutical compositions of the invention include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyl oleate. Proper fluidity can be maintained, for example, by the use of coating materials, such as lecithin, by the maintenance of the required particle size in the case of dispersions, and by the use of surfactants.

These compositions may also contain adjuvants such as preservatives, wetting agents, emulsifying agents and dispersing agents. Prevention of presence of microorganisms may be ensured both by sterilization procedures, supra, and by the inclusion of various antibacterial and antifungal agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also be desirable to include isotonic agents, such as sugars, sodium chloride, and the like into the compositions. In addition, prolonged absorption of the injectable pharmaceutical form may be brought about by the inclusion of agents which delay absorption such as aluminum monostearate and gelatin.

Pharmaceutically acceptable carriers include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. The use of such media and agents for pharmaceutically active substances is known in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in the pharmaceutical compositions of the invention is contemplated. Supplementary active compounds can also be incorporated into the compositions.

Therapeutic compositions typically must be sterile and stable under the conditions of manufacture and storage. The composition can be formulated as a solution, microemulsion, liposome, or other ordered structure suitable to high drug concentration. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. In many cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent that delays absorption, for example, monostearate salts and gelatin. Sterile injectable solutions can be prepared by incorporating the active compound in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by sterilization microfiltration. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying (lyophilization) that yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

Sterile injectable solutions can be prepared by incorporating the active compound in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by sterilization microfiltration. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying (lyophilization) that yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

The amount of active ingredient which can be combined with a carrier material to produce a single dosage form will vary depending upon the subject being treated, and the particular mode of administration. The amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the composition which produces a therapeutic effect. Generally, out of one hundred percent, this amount will range from about 0.01 percent to about ninety-nine percent of active ingredient, preferably from about 0.1 percent to about 70 percent, most preferably from about I percent to about 30 percent of active ingredient in combination with a pharmaceutically acceptable carrier.

Dosage regimens are adjusted to provide the optimum desired response (e.g., a therapeutic response). For example, a single bolus may be administered, several divided doses may be administered over time or the dose may be proportionally reduced or increased as indicated by the exigencies of the therapeutic situation. It is especially advantageous to formulate parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subjects to be treated; each unit contains a predetermined quantity of active compound calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the invention are dictated by and directly dependent on (a) the unique characteristics of the active compound and the particular therapeutic effect to be achieved, and (b) the limitations inherent in the art of compounding such an active compound for the treatment of sensitivity in individuals.

For administration of the antibody, the dosage ranges from about 0.0001 to 100 mg/kg, and more usually 0.01 to 5 mg/kg, of the host body weight. For example dosages can be 0.3 mg/kg body weight, 1 mg/kg body weight, 3 mg/kg body weight, 5 mg/kg body weight or 10 mg/kg body weight or within the range of 1-10 mg/kg. An exemplary treatment regime entails administration once per week, once every two weeks, once every three weeks, once every four weeks, once a month, once every 3 months or once every three to 6 months. Preferred dosage regimens for an anti-VSIG1 antibody of the invention include 1 mg/kg body weight or 3 mg/kg body weight via intravenous administration, with the antibody being given using one of the following dosing schedules: (i) every four weeks for six dosages, then every three months; (ii) every three weeks; (iii) 3 mg/kg body weight once followed by 1 mg/kg body weight every three weeks.

In some methods, two or more monoclonal antibodies with different binding specificities are administered simultaneously, in which case the dosage of each antibody administered falls within the ranges indicated. Antibody is usually administered on multiple occasions. Intervals between single dosages can be, for example, weekly, monthly, every three months or yearly. Intervals can also be irregular as indicated by measuring blood levels of antibody to the target antigen in the patient. In some methods, dosage is adjusted to achieve a plasma antibody concentration of about 1-1000 mug/ml and in some methods about 25-300. mu. g/ml.

Alternatively, antibody can be administered as a sustained release formulation, in which case less frequent administration is required. Dosage and frequency vary depending on the half-life of the antibody in the patient. In general, human antibodies show the longest half life, followed by humanized antibodies, chimeric antibodies, and nonhuman antibodies. The dosage and frequency of administration can vary depending on whether the treatment is prophylactic or therapeutic. In prophylactic applications, a relatively low dosage is administered at relatively infrequent intervals over a long period of time. Some patients continue to receive treatment for the rest of their lives. In therapeutic applications, a relatively high dosage at relatively short intervals is sometimes required until progression of the disease is reduced or terminated, and preferably until the patient shows partial or complete amelioration of symptoms of disease. Thereafter, the patient can be administered a prophylactic regime.

Actual dosage levels of the active ingredients in the pharmaceutical compositions of the present invention may be varied so as to obtain an amount of the active ingredient which is effective to achieve the desired therapeutic response for a particular patient, composition, and mode of administration, without being toxic to the patient. The selected dosage level will depend upon a variety of pharmacokinetic factors including the activity of the particular compositions of the present invention employed, or the ester, salt or amide thereof, the route of administration, the time of administration, the rate of excretion of the particular compound being employed, the duration of the treatment, other drugs, compounds and/or materials used in combination with the particular compositions employed, the age, sex, weight, condition, general health and prior medical history of the patient being treated, and like factors well known in the medical arts.

A “therapeutically effective dosage” of an anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32, or anti-FXYD3 antibody of the invention preferably results in a decrease in severity of disease symptoms, an increase in frequency and duration of disease symptom-free periods, an increase in lifepan, disease remission, or a prevention of impairment or disability due to the disease affliction. For example, for the treatment of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 positive tumors, e.g., lung tumors, ovarian tumors, and colon tumors, a “therapeutically effective dosage” preferably inhibits cell growth or tumor growth by at least about 20%, more preferably by at least about 40%, even more preferably by at least about 60%, and still more preferably by at least about 80% relative to untreated subjects. The ability of a compound to inhibit tumor growth can be evaluated in an animal model system predictive of efficacy in human tumors. Alternatively, this property of a composition can be evaluated by examining the ability of the compound to inhibit, such inhibition in vitro by assays known to the skilled practitioner. A therapeutically effective amount of a therapeutic compound can decrease tumor size, or otherwise ameliorate symptoms in a subject. One of ordinary skill in the art would be able to determine such amounts based on such factors as the subject's size, the severity of the subject's symptoms, and the particular composition or route of administration selected.

A composition of the present invention can be administered via one or more routes of administration using one or more of a variety of methods known in the art. As will be appreciated by the skilled artisan, the route and/or mode of administration will vary depending upon the desired results. Preferred routes of administration for antibodies of the invention include intravenous, intramuscular, intradermal, intraperitoneal, subcutaneous, spinal or other parenteral routes of administration, for example by injection or infusion. The phrase “parenteral administration” as used herein means modes of administration other than enteral and topical administration, usually by injection, and includes, without limitation, intravenous, intramuscular, intraarterial, intrathecal, intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, intraarticular, subcapsular, subarachnoid, intraspinal, epidural and intrasternal injection and infusion.

Alternatively, an antibody or other VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 drug or molecule and their conjugates and combinations thereof that modulates a VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen activity according to the invention can be administered via a non-parenteral route, such as a topical, epidermal or mucosal route of administration, for example, intranasally, orally, vaginally, rectally, sublingually or topically.

The active compounds can be prepared with carriers that will protect the compound against rapid release, such as a controlled release formulation, including implants, transdermal patches, and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Many methods for the preparation of such formulations are patented or generally known to those skilled in the art. See, e.g., Sustained and Controlled Release Drug Delivery Systems, J. R. Robinson, ed., Marcel Dekker, Inc., New York, 1978.

Therapeutic compositions can be administered with medical devices known in the art. For example, in a preferred embodiment, a therapeutic composition of the invention can be administered with a needles hypodermic injection device, such as the devices disclosed in U.S. Pat. Nos. 5,399,163; 5,383,851; 5,312,335; 5,064,413; 4,941,880; 4,790,824; or 4,596,556. Examples of well-known implants and modules useful in the present invention include: U.S. Pat. No. 4,487,603, which discloses an implantable micro-infusion pump for dispensing medication at a controlled rate; U.S. Pat. No. 4,486,194, which discloses a therapeutic device for administering medicaments through the skin; U.S. Pat. No. 4,447,233, which discloses a medication infusion pump for delivering medication at a precise infusion rate; U.S. Pat. No. 4,447,224, which discloses a variable flow implantable infusion apparatus for continuous drug delivery; U.S. Pat. No. 4,439,196, which discloses an osmotic drug delivery system having multi-chamber compartments; and U.S. Pat. No. 4,475,196, which discloses an osmotic drug delivery system. These patents are incorporated herein by reference. Many other such implants, delivery systems, and modules are known to those skilled in the art.

In certain embodiments, the antibodies or other VSIG1 related drugs of the invention can be formulated to ensure proper distribution in vivo. For example, the blood-brain barrier (BBB) excludes many highly hydrophilic compounds. To ensure that the therapeutic compounds of the invention cross the BBB (if desired), they can be formulated, for example, in liposomes. For methods of manufacturing liposomes, see, e.g., U.S. Pat. Nos. 4,522,811; 5,374,548; and 5,399,331. The liposomes may comprise one or more moieties which are selectively transported into specific cells or organs, thus enhance targeted drug delivery (see, e.g., V. V. Ranade (1989) J. Clin. Pharmacol. 29:685). Exemplary targeting moieties include folate or biotin (see, e.g., U.S. Pat. No. 5,416,016 to Low et al.); mannosides (Umezawa et al., (1988) Biochem. Biophys. Res. Commun. 153:1038); antibodies (P. G. Bloeman et al. (1995) FEBS Lett. 357:140; M. Owais et al. (1995) Antimicrob. Agents Chemother. 39:180); surfactant protein A receptor (Briscoe et al. (1995) Am. J Physiol. 1233:134); p120 (Schreier et al. (1994) J. Biol. Chem. 269:9090); see also K. Keinanen; M. L. Laukkanen (1994) FEBS Lett. 346:123; J. J. Killion; I. J. Fidler (1994) Immunomethods 4:273.

Diagnostic Uses of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 Antigen and Corresponding Polynucleotides

According to some embodiments, the sample taken from a subject (patient) to perform the diagnostic assay according to the present invention is selected from the group consisting of a body fluid or secretion including but not limited to blood, serum, urine, plasma, prostatic fluid, seminal fluid, semen, the external secretions of the skin, respiratory, intestinal, and genitourinary tracts, tears, cerebrospinal fluid, sputum, saliva, milk, peritoneal fluid, pleural fluid, cyst fluid, secretions of the breast ductal system (and/or lavage thereof), broncho alveolar lavage, lavage of the reproductive system and lavage of any other part of the body or system in the body; samples of any organ including isolated cells or tissues, wherein the cell or tissue can be obtained from an organ selected from, but not limited to lung, colon, ovarian and/or breast tissue; stool or a tissue sample, or any combination thereof. In some embodiments, the term encompasses samples of in vivo cell culture constituents. Prior to be subjected to the diagnostic assay, the sample can optionally be diluted with a suitable eluant.

In some embodiments, the phrase “marker” in the context of the present invention refers to a nucleic acid fragment, a peptide, or a polypeptide, which is differentially present in a sample taken from patients (subjects) having one of the herein-described diseases or conditions, as compared to a comparable sample taken from subjects who do not have one the above-described diseases or conditions.

In some embodiments, the term “polypeptide” is to be understood to refer to a molecule comprising from at least 2 to several thousand or more amino acids. The term “polypeptide” is to be understood to include, inter alia, native peptides (either degradation products, synthetically synthesized peptides or recombinant peptides), peptidomimetics, such as peptoids and semipeptoids or peptide analogs, which may comprise, for example, any desirable modification, including, inter alia, modifications rendering the peptides more stable while in a body or more capable of penetrating into cells, or others as will be appreciated by one skilled in the art. Such modifications include, but are not limited to N terminus modification, C terminus modification, peptide bond modification, backbone modifications, residue modification, or others. Inclusion of such peptides within the polypeptides of this invention may produce a polypeptide sharing identity with the polypeptides described herein, for example, those provided in the sequence listing.

In some embodiments, the phrase “differentially present” refers to differences in the quantity or quality of a marker present in a sample taken from patients having one of the herein-described diseases or conditions as compared to a comparable sample taken from patients who do not have one of the herein-described diseases or conditions. For example, a nucleic acid fragment may optionally be differentially present between the two samples if the amount of the nucleic acid fragment in one sample is significantly different from the amount of the nucleic acid fragment in the other sample, for example as measured by hybridization and/or NAT-based assays. A polypeptide is differentially present between the two samples if the amount of the polypeptide in one sample is significantly different from the amount of the polypeptide in the other sample. It should be noted that if the marker is detectable in one sample and not detectable in the other, then such a marker can be considered to be differentially present. Optionally, a relatively low amount of up-regulation may serve as the marker, as described herein. One of ordinary skill in the art could easily determine such relative levels of the markers; further guidance is provided in the description of each individual marker below.

In some embodiments, the phrase “diagnostic” means identifying the presence or nature of a pathologic condition. Diagnostic methods differ in their sensitivity and specificity. The “sensitivity” of a diagnostic assay is the percentage of diseased individuals who test positive (percent of “true positives”). Diseased individuals not detected by the assay are “false negatives.” Subjects who are not diseased and who test negative in the assay are termed “true negatives.” The “specificity” of a diagnostic assay is 1 minus the false positive rate, where the “false positive” rate is defined as the proportion of those without the disease who test positive. While a particular diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the method provides a positive indication that aids in diagnosis.

In some embodiments, the phrase “qualitative” when in reference to differences in expression levels of a polynucleotide or polypeptide as described herein, refers to the presence versus absence of expression, or in some embodiments, the temporal regulation of expression, or in some embodiments, the timing of expression, or in some embodiments, any post-translational modifications to the expressed molecule, and others, as will be appreciated by one skilled in the art. In some embodiments, the phrase “quantitative” when in reference to differences in expression levels of a polynucleotide or polypeptide as described herein, refers to absolute differences in quantity of expression, as determined by any means, known in the art, or in other embodiments, relative differences, which may be statistically significant, or in some embodiments, when viewed as a whole or over a prolonged period of time, etc., indicate a trend in terms of differences in expression.

In some embodiments, the term “diagnosing” refers to classifying a disease or a symptom, determining a severity of the disease, monitoring disease progression, forecasting an outcome of a disease and/or prospects of recovery. The term “detecting” may also optionally encompass any of the above.

Diagnosis of a disease according to the present invention can, in some embodiments, be affected by determining a level of a polynucleotide or a polypeptide of the present invention in a biological sample obtained from the subject, wherein the level determined can be correlated with predisposition to, or presence or absence of the disease. It should be noted that a “biological sample obtained from the subject” may also optionally comprise a sample that has not been physically removed from the subject, as described in greater detail below.

In some embodiments, the term “level” refers to expression levels of RNA and/or protein or to DNA copy number of a marker of the present invention.

Typically the level of the marker in a biological sample obtained from the subject is different (i.e., increased or decreased) from the level of the same marker in a similar sample obtained from a healthy individual (examples of biological samples are described herein).

Numerous well known tissue or fluid collection methods can be utilized to collect the biological sample from the subject in order to determine the level of DNA, RNA and/or polypeptide of the marker of interest in the subject.

Examples include, but are not limited to, fine needle biopsy, needle biopsy, core needle biopsy and surgical biopsy (e.g., brain biopsy), and lavage. Regardless of the procedure employed, once a biopsy/sample is obtained the level of the marker can be determined and a diagnosis can thus be made.

Determining the level of the same marker in normal tissues of the same origin is preferably effected along-side to detect an elevated expression and/or amplification and/or a decreased expression, of the marker as opposed to the normal tissues.

In some embodiments, the term “test amount” of a marker refers to an amount of a marker in a subject's sample that is consistent with a diagnosis of a particular disease or condition. A test amount can be either in absolute amount (e.g., microgram/ml) or a relative amount (e.g., relative intensity of signals).

In some embodiments, the term “control amount” of a marker can be any amount or a range of amounts to be compared against a test amount of a marker. For example, a control amount of a marker can be the amount of a marker in a patient with a particular disease or condition or a person without such a disease or condition. A control amount can be either in absolute amount (e.g., microgram/ml) or a relative amount (e.g., relative intensity of signals).

In some embodiments, the term “detect” refers to identifying the presence, absence or amount of the object to be detected.

In some embodiments, the term “label” includes any moiety or item detectable by spectroscopic, photo chemical, biochemical, immunochemical, or chemical means. For example, useful labels include 32P, 35S, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin-streptavadin, dioxigenin, haptens and proteins for which antisera or monoclonal antibodies are available, or nucleic acid molecules with a sequence complementary to a target. The label often generates a measurable signal, such as a radioactive, chromogenic, or fluorescent signal, that can be used to quantify the amount of bound label in a sample. The label can be incorporated in or attached to a primer or probe either covalently, or through ionic, van der Waals or hydrogen bonds, e.g., incorporation of radioactive nucleotides, or biotinylated nucleotides that are recognized by streptavadin. The label may be directly or indirectly detectable. Indirect detection can involve the binding of a second label to the first label, directly or indirectly. For example, the label can be the ligand of a binding partner, such as biotin, which is a binding partner for streptavadin, or a nucleotide sequence, which is the binding partner for a complementary sequence, to which it can specifically hybridize. The binding partner may itself be directly detectable, for example, an antibody may be itself labeled with a fluorescent molecule. The binding partner also may be indirectly detectable, for example, a nucleic acid having a complementary nucleotide sequence can be a part of a branched DNA molecule that is in turn detectable through hybridization with other labeled nucleic acid molecules (see, e.g., P. D. Fahrlander and A. Klausner, Bio/Technology 6:1165 (1988)). Quantitation of the signal is achieved by, e.g., scintillation counting, densitometry, or flow cytometry.

Exemplary detectable labels, optionally and preferably for use with immunoassays, include but are not limited to magnetic beads, fluorescent dyes, radiolabels, enzymes (e.g., horse radish peroxide, alkaline phosphatase and others commonly used in an ELISA), and calorimetric labels such as colloidal gold or colored glass or plastic beads. Alternatively, the marker in the sample can be detected using an indirect assay, wherein, for example, a second, labeled antibody is used to detect bound marker-specific antibody, and/or in a competition or inhibition assay wherein, for example, a monoclonal antibody which binds to a distinct epitope of the marker are incubated simultaneously with the mixture.

“Immunoassay” is an assay that uses an antibody to specifically bind an antigen. The immunoassay is characterized by the use of specific binding properties of a particular antibody to isolate, target, and/or quantify the antigen.

The phrase “specifically (or selectively) binds” to an antibody or “specifically (or selectively) immunoreactive with,” or “specifically interacts or binds” when referring to a protein or peptide (or other epitope), refers, in some embodiments, to a binding reaction that is determinative of the presence of the protein in a heterogeneous population of proteins and other biologics. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein at least two times greater than the background (non-specific signal) and do not substantially bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. For example, polyclonal antibodies raised to seminal basic protein from specific species such as rat, mouse, or human can be selected to obtain only those polyclonal antibodies that are specifically immunoreactive with seminal basic protein and not with other proteins, except for polymorphic variants and alleles of seminal basic protein. This selection may be achieved by subtracting out antibodies that cross-react with seminal basic protein molecules from other species. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays are routinely used to select antibodies specifically immunoreactive with a protein (see, e.g., Harlow & Lane, Antibodies, A Laboratory Manual (1988), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity). Typically a specific or selective reaction will be at least twice background signal or noise and more typically more than 10 to 100 times background.

In another embodiment, this invention provides a method for detecting the polypeptides of this invention in a biological sample, comprising: contacting a biological sample with an antibody specifically recognizing a polypeptide according to the present invention and detecting said interaction; wherein the presence of an interaction correlates with the presence of a polypeptide in the biological sample.

In some embodiments of the present invention, the polypeptides described herein are non-limiting examples of markers for diagnosing a disease and/or an indicative condition. Each marker of the present invention can be used alone or in combination, for various uses, including but not limited to, prognosis, prediction, screening, early diagnosis, determination of progression, therapy selection and treatment monitoring of a disease and/or an indicative condition.

In a related object the detected diseases will include cancers such as non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic.

In another related object the detected diseases will include autoimmune and neoplastic disorders selected from the group consisting of Multiple sclerosis; Psoriasis; Rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's disease; immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

In another related object the detected diseases will include rejection of any organ transplant and/or Graft versus host disease.

Each polypeptide/polynucleotide of the present invention can be used alone or in combination, for various uses, including but not limited to, prognosis, prediction, screening, early diagnosis, determination of progression, therapy selection and treatment monitoring of disease and/or an indicative condition, as detailed above.

Such a combination may optionally comprise any subcombination of markers, and/or a combination featuring at least one other marker, for example a known marker. Furthermore, such a combination may optionally and preferably be used as described above with regard to determining a ratio between a quantitative or semi-quantitative measurement of any marker described herein to any other marker described herein, and/or any other known marker, and/or any other marker.

According to further embodiments of the present invention markers of the present invention might optionally be used alone or in combination with known markers for lung cancer, including but not limited to CEA, CA15-3, Beta-2-microglobulin, CA19-9, TPA, and/or in combination with the known proteins for the variant marker as described herein.

According to further embodiments of the present invention markers of the present invention might optionally be used alone or in combination with known markers for ovarian cancer, including but not limited to CEA, CA125 (Mucin 16), CA72-4TAG, CA-50, CA 54-61, CA-195 and CA 19-9 in combination with CA-125, and/or in combination with the known proteins for the variant marker as described herein.

According to further embodiments of the present invention markers of the present invention might optionally be used alone or in combination with known markers for colon cancer, including but not limited to CEA, CA19-9, CA50, and/or in combination with the known proteins for the variant marker as described herein.

In some embodiments of the present invention, there are provided of methods, uses, devices and assays for the diagnosis of a disease or condition. Optionally a plurality of markers may be used with the present invention. The plurality of markers may optionally include a markers described herein, and/or one or more known markers. The plurality of markers is preferably then correlated with the disease or condition. For example, such correlating may optionally comprise determining the concentration of each of the plurality of markers, and individually comparing each marker concentration to a threshold level. Optionally, if the marker concentration is above or below the threshold level (depending upon the marker and/or the diagnostic test being performed), the marker concentration correlates with the disease or condition. Optionally and preferably, a plurality of marker concentrations correlates with the disease or condition.

Alternatively, such correlating may optionally comprise determining the concentration of each of the plurality of markers, calculating a single index value based on the concentration of each of the plurality of markers, and comparing the index value to a threshold level.

Also alternatively, such correlating may optionally comprise determining a temporal change in at least one of the markers, and wherein the temporal change is used in the correlating step.

Also alternatively, such correlating may optionally comprise determining whether at least “X” number of the plurality of markers has a concentration outside of a predetermined range and/or above or below a threshold (as described above). The value of “X” may optionally be one marker, a plurality of markers or all of the markers; alternatively or additionally, rather than including any marker in the count for “X”, one or more specific markers of the plurality of markers may optionally be required to correlate with the disease or condition (according to a range and/or threshold).

Also alternatively, such correlating may optionally comprise determining whether a ratio of marker concentrations for two markers is outside a range and/or above or below a threshold. Optionally, if the ratio is above or below the threshold level and/or outside a range, the ratio correlates with the disease or condition.

Optionally, a combination of two or more these correlations may be used with a single panel and/or for correlating between a plurality of panels.

Optionally, the method distinguishes a disease or condition with a sensitivity of at least 70% at a specificity of at least 85% when compared to normal subjects. As used herein, sensitivity relates to the number of positive (diseased) samples detected out of the total number of positive samples present; specificity relates to the number of true negative (non-diseased) samples detected out of the total number of negative samples present. Preferably, the method distinguishes a disease or condition with a sensitivity of at least 80% at a specificity of at least 90% when compared to normal subjects. More preferably, the method distinguishes a disease or condition with a sensitivity of at least 90% at a specificity of at least 90% when compared to normal subjects. Also more preferably, the method distinguishes a disease or condition with a sensitivity of at least 70% at a specificity of at least 85% when compared to subjects exhibiting symptoms that mimic disease or condition symptoms.

A marker panel may be analyzed in a number of fashions well known to those of skill in the art. For example, each member of a panel may be compared to a “normal” value, or a value indicating a particular outcome. A particular diagnosis/prognosis may depend upon the comparison of each marker to this value; alternatively, if only a subset of markers is outside of a normal range, this subset may be indicative of a particular diagnosis/prognosis. The skilled artisan will also understand that diagnostic markers, differential diagnostic markers, prognostic markers, time of onset markers, disease or condition differentiating markers, etc., may be combined in a single assay or device. Markers may also be commonly used for multiple purposes by, for example, applying a different threshold or a different weighting factor to the marker for the different purposes.

In one embodiment, the panels comprise markers for the following purposes: diagnosis of a disease; diagnosis of disease and indication if the disease is in an acute phase and/or if an acute attack of the disease has occurred; diagnosis of disease and indication if the disease is in a non-acute phase and/or if a non-acute attack of the disease has occurred; indication whether a combination of acute and non-acute phases or attacks has occurred; diagnosis of a disease and prognosis of a subsequent adverse outcome; diagnosis of a disease and prognosis of a subsequent acute or non-acute phase or attack; disease progression (for example for cancer, such progression may include for example occurrence or recurrence of metastasis).

The above diagnoses may also optionally include differential diagnosis of the disease to distinguish it from other diseases, including those diseases that may feature one or more similar or identical symptoms.

In certain embodiments, one or more diagnostic or prognostic indicators are correlated to a condition or disease by merely the presence or absence of the indicators. In other embodiments, threshold levels of a diagnostic or prognostic indicators can be established, and the level of the indicators in a patient sample can simply be compared to the threshold levels. The sensitivity and specificity of a diagnostic and/or prognostic test depends on more than just the analytical “quality” of the test—they also depend on the definition of what constitutes an abnormal result. In practice, Receiver Operating Characteristic curves, or “ROC” curves, are typically calculated by plotting the value of a variable versus its relative frequency in “normal” and “disease” populations, and/or by comparison of results from a subject before, during and/or after treatment.

According to embodiments of the present invention, VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 protein, polynucleotide or a fragment thereof, may be featured as a biomarker for detecting disease and/or an indicative condition, as detailed above.

According to still other embodiments, the present invention optionally and preferably encompasses any amino acid sequence or fragment thereof encoded by a nucleic acid sequence corresponding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 as described herein. Any oligopeptide or peptide relating to such an amino acid sequence or fragment thereof may optionally also (additionally or alternatively) be used as a biomarker.

In still other embodiments, the present invention provides a method for detecting a polynucleotide of this invention in a biological sample, using NAT based assays, comprising: hybridizing the isolated nucleic acid molecules or oligonucleotide fragments of at least about a minimum length to a nucleic acid material of a biological sample and detecting a hybridization complex; wherein the presence of a hybridization complex correlates with the presence of the polynucleotide in the biological sample. Non-limiting examples of methods or assays are described below.

The present invention also relates to kits based upon such diagnostic methods or assays.

Nucleic Acid Technology (Nat) Based Assays:

Detection of a nucleic acid of interest in a biological sample may also optionally be effected by NAT-based assays, which involve nucleic acid amplification technology, such as PCR for example (or variations thereof such as real-time PCR for example). As used herein, a “primer” defines an oligonucleotide which is capable of annealing to (hybridizing with) a target sequence, thereby creating a double stranded region which can serve as an initiation point for DNA synthesis under suitable conditions. Amplification of a selected, or target, nucleic acid sequence may be carried out by a number of suitable methods known in the art. Non-limiting examples of amplification techniques include polymerase chain reaction (PCR), ligase chain reaction (LCR), strand displacement amplification (SDA), transcription-based amplification, the q3 replicase system and NASBA (Kwoh et al., 1989, Proc. Natl. Acad. Sci. USA 86, 1173-1177; Lizardi et al., 1988, BioTechnology 6:1197-1202; Malek et al., 1994, Methods Mol. Biol., 28:253-260; and Sambrook et al., 1989, supra). Non-limiting examples of Nucleic Acid Technology-based assay is selected from the group consisting of a PCR, Real-Time PCR, LCR, Self-Sustained Synthetic Reaction, Q-Beta Replicase, Cycling probe reaction, Branched DNA, RFLP analysis, DGGE/TGGE, Single-Strand Conformation Polymorphism, Dideoxy fingerprinting, microarrays, Fluorescense In Situ Hybridization and Comparative Genomic Hybridization. The terminology “amplification pair” (or “primer pair”) refers herein to a pair of oligonucleotides (oligos) of the present invention, which are selected to be used together in amplifying a selected nucleic acid sequence by one of a number of types of amplification processes, preferably a polymerase chain reaction. As commonly known in the art, the oligos are designed to bind to a complementary sequence under selected conditions. In one particular embodiment, amplification of a nucleic acid sample from a patient is amplified under conditions which favor the amplification of the most abundant differentially expressed nucleic acid. In one preferred embodiment, RT-PCR is carried out on an mRNA sample from a patient under conditions which favor the amplification of the most abundant mRNA. In another preferred embodiment, the amplification of the differentially expressed nucleic acids is carried out simultaneously. It will be realized by a person skilled in the art that such methods could be adapted for the detection of differentially expressed proteins instead of differentially expressed nucleic acid sequences. The nucleic acid (i.e. DNA or RNA) for practicing the present invention may be obtained according to well known methods.

Oligonucleotide primers of the present invention may be of any suitable length, depending on the particular assay format and the particular needs and targeted genomes employed. Optionally, the oligonucleotide primers are at least 12 nucleotides in length, preferably between 15 and 24 molecules, and they may be adapted to be especially suited to a chosen nucleic acid amplification system. As commonly known in the art, the oligonucleotide primers can be designed by taking into consideration the melting point of hybridization thereof with its targeted sequence (Sambrook et al., 1989, Molecular Cloning—A Laboratory Manual, 2nd Edition, CSH Laboratories; Ausubel et al., 1989, in Current Protocols in Molecular Biology, John Wiley & Sons Inc., N.Y.).

Immunoassays

In another embodiment of the present invention, an immunoassay can be used to qualitatively or quantitatively detect and analyze markers in a sample. This method comprises: providing an antibody that specifically binds to a marker; contacting a sample with the antibody; and detecting the presence of a complex of the antibody bound to the marker in the sample.

To prepare an antibody that specifically binds to a marker, purified protein markers can be used. Antibodies that specifically bind to a protein marker can be prepared using any suitable methods known in the art.

After the antibody is provided, a marker can be detected and/or quantified using any of a number of well recognized immunological binding assays. Useful assays include, for example, an enzyme immune assay (EIA) such as enzyme-linked immunosorbent assay (ELISA), a radioimmune assay (RIA), a Western blot assay, or a slot blot assay see, e.g., U.S. Pat. Nos. 4,366,241; 4,376,110; 4,517,288; and 4,837,168). Generally, a sample obtained from a subject can be contacted with the antibody that specifically binds the marker.

Optionally, the antibody can be fixed to a solid support to facilitate washing and subsequent isolation of the complex, prior to contacting the antibody with a sample. Examples of solid supports include but are not limited to glass or plastic in the form of, e.g., a microtiter plate, a stick, a bead, or a microbead. Antibodies can also be attached to a solid support.

After incubating the sample with antibodies, the mixture is washed and the antibody-marker complex formed can be detected. This can be accomplished by incubating the washed mixture with a detection reagent. Alternatively, the marker in the sample can be detected using an indirect assay, wherein, for example, a second, labeled antibody is used to detect bound marker-specific antibody, and/or in a competition or inhibition assay wherein, for example, a monoclonal antibody which binds to a distinct epitope of the marker are incubated simultaneously with the mixture.

Throughout the assays, incubation and/or washing steps may be required after each combination of reagents. Incubation steps can vary from about 5 seconds to several hours, preferably from about 5 minutes to about 24 hours. However, the incubation time will depend upon the assay format, marker, volume of solution, concentrations and the like. Usually the assays will be carried out at ambient temperature, although they can be conducted over a range of temperatures, such as 10° C. to 40° C.

The immunoassay can be used to determine a test amount of a marker in a sample from a subject. First, a test amount of a marker in a sample can be detected using the immunoassay methods described above. If a marker is present in the sample, it will form an antibody-marker complex with an antibody that specifically binds the marker under suitable incubation conditions described above. The amount of an antibody-marker complex can optionally be determined by comparing to a standard. As noted above, the test amount of marker need not be measured in absolute units, as long as the unit of measurement can be compared to a control amount and/or signal.

Radio-immunoassay (RIA): In one version, this method involves precipitation of the desired substrate and in the methods detailed herein below, with a specific antibody and radiolabeled antibody binding protein (e.g., protein A labeled with I125) immobilized on a precipitable carrier such as agarose beads. The number of counts in the precipitated pellet is proportional to the amount of substrate.

In an alternate version of the RIA, a labeled substrate and an unlabelled antibody binding protein are employed. A sample containing an unknown amount of substrate is added in varying amounts. The decrease in precipitated counts from the labeled substrate is proportional to the amount of substrate in the added sample.

Enzyme linked immunosorbent assay (ELISA): This method involves fixation of a sample (e.g., fixed cells or a proteinaceous solution) containing a protein substrate to a surface such as a well of a microtiter plate. A substrate specific antibody coupled to an enzyme is applied and allowed to bind to the substrate. Presence of the antibody is then detected and quantitated by a colorimetric reaction employing the enzyme coupled to the antibody. Enzymes commonly employed in this method include horseradish peroxidase and alkaline phosphatase. If well calibrated and within the linear range of response, the amount of substrate present in the sample is proportional to the amount of color produced. A substrate standard is generally employed to improve quantitative accuracy.

Western blot: This method involves separation of a substrate from other protein by means of an acrylamide gel followed by transfer of the substrate to a membrane (e.g., nylon or PVDF). Presence of the substrate is then detected by antibodies specific to the substrate, which are in turn detected by antibody binding reagents. Antibody binding reagents may be, for example, protein A, or other antibodies. Antibody binding reagents may be radiolabeled or enzyme linked as described hereinabove. Detection may be by autoradiography, colorimetric reaction or chemiluminescence. This method allows both quantitation of an amount of substrate and determination of its identity by a relative position on the membrane which is indicative of a migration distance in the acrylamide gel during electrophoresis.

Immunohistochemical analysis: This method involves detection of a substrate in situ in fixed cells by substrate specific antibodies. The substrate specific antibodies may be enzyme linked or linked to fluorophores. Detection is by microscopy and subjective evaluation. If enzyme linked antibodies are employed, a colorimetric reaction may be required.

Fluorescence activated cell sorting (FACS): This method involves detection of a substrate in situ in cells by substrate specific antibodies. The substrate specific antibodies are linked to fluorophores. Detection is by means of a cell sorting machine which reads the wavelength of light emitted from each cell as it passes through a light beam. This method may employ two or more antibodies simultaneously.

Radio-Imaging Methods

These methods include but are not limited to, positron emission tomography (PET) single photon emission computed tomography (SPECT). Both of these techniques are non-invasive, and can be used to detect and/or measure a wide variety of tissue events and/or functions, such as detecting cancerous cells for example. Unlike PET, SPECT can optionally be used with two labels simultaneously. SPECT has some other advantages as well, for example with regard to cost and the types of labels that can be used. For example, U.S. Pat. No. 6,696,686 describes the use of SPECT for detection of breast cancer, and is hereby incorporated by reference as if fully set forth herein.

Theranostics:

The term theranostics describes the use of diagnostic testing to diagnose the disease, choose the correct treatment regime according to the results of diagnostic testing and/or monitor the patient response to therapy according to the results of diagnostic testing. Theranostic tests can be used to select patients for treatments that are particularly likely to benefit them and unlikely to produce side-effects. They can also provide an early and objective indication of treatment efficacy in individual patients, so that (if necessary) the treatment can be altered with a minimum of delay. For example: DAKO and Genentech together created HercepTest and Herceptin (trastuzumab) for the treatment of breast cancer, the first theranostic test approved simultaneously with a new therapeutic drug. In addition to HercepTest (which is an immunohistochemical test), other theranostic tests are in development which use traditional clinical chemistry, immunoassay, cell-based technologies and nucleic acid tests. PPGx's recently launched TPMT (thiopurine S-methyltransferase) test, which is enabling doctors to identify patients at risk for potentially fatal adverse reactions to 6-mercaptopurine, an agent used in the treatment of leukemia. Also, Nova Molecular pioneered SNP genotyping of the apolipoprotein E gene to predict Alzheimer's disease patients' responses to cholinomimetic therapies and it is now widely used in clinical trials of new drugs for this indication. Thus, the field of theranostics represents the intersection of diagnostic testing information that predicts the response of a patient to a treatment with the selection of the appropriate treatment for that particular patient.

Surrogate Markers:

A surrogate marker is a marker, that is detectable in a laboratory and/or according to a physical sign or symptom on the patient, and that is used in therapeutic trials as a substitute for a clinically meaningful endpoint. The surrogate marker is a direct measure of how a patient feels, functions, or survives which is expected to predict the effect of the therapy. The need for surrogate markers mainly arises when such markers can be measured earlier, more conveniently, or more frequently than the endpoints of interest in terms of the effect of a treatment on a patient, which are referred to as the clinical endpoints. Ideally, a surrogate marker should be biologically plausible, predictive of disease progression and measurable by standardized assays (including but not limited to traditional clinical chemistry, immunoassay, cell-based technologies, nucleic acid tests and imaging modalities).

Surrogate endpoints were used first mainly in the cardiovascular area. For example, antihypertensive drugs have been approved based on their effectiveness in lowering blood pressure. Similarly, in the past, cholesterol-lowering agents have been approved based on their ability to decrease serum cholesterol, not on the direct evidence that they decrease mortality from atherosclerotic heart disease. The measurement of cholesterol levels is now an accepted surrogate marker of atherosclerosis. In addition, currently two commonly used surrogate markers in HIV studies are CD4+ T cell counts and quantitative plasma HIV RNA (viral load). In some embodiments of this invention, the polypeptide/polynucleotide expression pattern may serve as a surrogate marker for a particular disease, as will be appreciated by one skilled in the art.

Uses and Methods of the Invention

The VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 drugs according to the invention, especially antibodies, particularly the human antibodies, antibody compositions, and soluble conjugates containing the ectodomain of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen or a fragment or variant thereof, or a corresponding nucleic acid sequence or vector or cell expressing same and methods of the present invention have numerous in vitro and in vivo diagnostic and therapeutic utilities involving the diagnosis and treatment of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen related disorders and/or disorders wherein modulation of immune co-stimulation e.g., involving B7-related immune costimulation involving VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen is therapeutically desirable. As noted these conditions include in particular cancers that differentially express the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen such as lung cancer, ovarian cancer, colon cancer, including invasive and metastatic forms thereof, and/or autoimmune conditions wherein modulation of costimulation such as involving B7 is therapeutically desirable. The subject anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies may prevent B7 mediated negative stimulation of T cell activity against cancer cells and/or prevent positive stimulation of T cell activity. Such antibodies may be used in the treatment of conditions including cancers such non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic as well as non-malignant disorders such as immune disorders including but not limited to transplant rejection and graft versus host disease, and autoimmune disorders such as afore-mentioned.

For example, these molecules can be administered to cells in culture, in vitro or ex vivo, or to human subjects, e.g., in vivo, to treat, prevent and to diagnose a variety of disorders. Preferred subjects include human patients having disorders mediated by cells expressing the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 antigen and cells that posses VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 activity. The methods are particularly suitable for treating human patients having a disorder associated with aberrant VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen expression using antibodies that specifically bind AI581519_P3 (SEQ ID NO:11), AI581519_P4 (SEQ ID NO:12), AI581519_P5 (SEQ ID NO:13), AI581519_P7 (SEQ ID NO:14), AI581519_P9 (SEQ ID NO:15), AI581519_P10 (SEQ ID NO:16), AA424839_P3 (SEQ ID NO:22), AA424839_P5 (SEQ ID NO:21), AA424839_P7 (SEQ ID NO:23), or AA424839_1_P11 (SEQ ID NO:24), H68654_1_P2 (SEQ ID NO:35), H68654_1_P5 (SEQ ID NO:36), H68654_1_P7 (SEQ ID NO:37), H68654_1_P12 (SEQ ID NO:38), H68654_1_P13 (SEQ ID NO:39), H68654_1_P14 (SEQ ID NO:40), AI216611_P0 (SEQ ID NO:43), AI216611_P1 (SEQ ID NO:44), H19011_1_P8 (SEQ ID NO:48), H19011_1_P9 (SEQ ID NO:50), R31375_P0 (SEQ ID NO:70), R31375_P14 (SEQ ID NO:72), R31375_P31 (SEQ ID NO:73) or R31375_P33 (SEQ ID NO:74).

VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, or FXYD3 drugs according to the invention, are administered together with another agent, the two can be administered in either order or simultaneously.

Given the specific binding of the antibodies of the invention for VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 the antibodies of the invention can be used to specifically detect VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 expression on the surface of cells and, moreover, can be used to purify VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen via immunoaffinity purification.

Furthermore, given the expression of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 on various tumor cells, the human antibodies, antibody compositions and methods of the present invention can be used to treat a subject with a tumorigenic disorder, e.g., a disorder characterized by the presence of tumor cells expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen such as lung cancer and ovarian cancer, as mentioned.

In one embodiment, the antibodies (e.g., human monoclonal antibodies, multispecific and bispecific molecules and compositions) of the invention can be used to detect levels of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 or levels of cells which contain VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, respectively, on their membrane surface, which levels can then be linked to certain disease symptoms. Alternatively, the antibodies can be used to inhibit or block VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 function which, in turn, can be linked to the prevention or amelioration of certain disease symptoms, thereby implicating VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, respectively, as a mediator of the disease. This can be achieved by contacting a sample and a control sample with the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibody under conditions that allow for the formation of a complex between the corresponding antibody and VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, respectively. Any complexes formed between the antibody and VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 are detected and compared in the sample and the control.

In another embodiment, the antibodies (e.g., human antibodies, multispecific and bispecific molecules and compositions) of the invention can be initially tested for binding activity associated with therapeutic or diagnostic use in vitro. For example, compositions of the invention can be tested using low cytometric assays.

The antibodies (e.g., human antibodies, multispecific and bispecific molecules, immunoconjugates and compositions) of the invention have additional utility in therapy and diagnosis of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3-related diseases. For example, the human monoclonal antibodies, the multispecific or bispecific molecules and the immunoconjugates can be used to elicit in vivo or in vitro one or more of the following biological activities: to inhibit the growth of and/or kill a cell expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3; to mediate phagocytosis or ADCC of a cell expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 in the presence of human effector cells, or to block VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 ligand binding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, respectively.

In a particular embodiment, the antibodies (e.g., human antibodies, multispecific and bispecific molecules and compositions) are used in vivo to treat, prevent or diagnose a variety of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3-related diseases. Examples of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3-related diseases include, among others, cancer, such as lung cancer, ovarian cancer, colon cancer, other non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic. Additional examples of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3-related diseases include, among others, non-malignant disorders such as immune disorders including but not limited to autoimmune diseases, transplant rejection and graft versus host disease. Such disorders include by way of example autoimmune diseases selected from multiple sclerosis; psoriasis; rheumatoid arthritis; Systemic lupus erythematosus; Ulcerative colitis; Crohn's' disease, immune disorders associated with graft transplantation rejection, benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves' disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.

Suitable routes of administering the antibody compositions (e.g., human monoclonal antibodies, multispecific and bispecific molecules and immunoconjugates) of the invention in vivo and in vitro are well known in the art and can be selected by those of ordinary skill. For example, the antibody compositions can be administered by injection (e.g., intravenous or subcutaneous). Suitable dosages of the molecules used will depend on the age and weight of the subject and the concentration and/or formulation of the antibody composition.

As previously described, human anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies of the invention can be co-administered with one or other more therapeutic agents, e.g., an cytotoxic agent, a radiotoxic agent or an immunosuppressive agent. The antibody can be linked to the agent (as an immunocomplex) or can be administered separate from the agent. In the latter case (separate administration), the antibody can be administered before, after or concurrently with the agent or can be co-administered with other known therapies, e.g., an anti-cancer therapy, e.g., radiation. Such therapeutic agents include, among others, anti-neoplastic agents such as doxorubicin (adriamycin), cisplatin bleomycin sulfate, carmustine, chlorambucil, and cyclophosphamide hydroxyurea which, by themselves, are only effective at levels which are toxic or subtoxic to a patient. Cisplatin is intravenously administered as a 100 mg/dose once every four weeks and adriamycin is intravenously administered as a 60-75 mg/ml dose once every 21 days. Co-administration of the human anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies, or antigen binding fragments thereof, of the present invention with chemotherapeutic agents provides two anti-cancer agents which operate via different mechanisms which yield a cytotoxic effect to human tumor cells. Such co-administration can solve problems due to development of resistance to drugs or a change in the antigenicity of the tumor cells which would render them unreactive with the antibody.

Target-specific effector cells, e.g., effector cells linked to compositions (e.g., human antibodies, multispecific and bispecific molecules) of the invention can also be used as therapeutic agents. Effector cells for targeting can be human leukocytes such as macrophages, neutrophils or monocytes. Other cells include eosinophils, natural killer cells and other IgG- or IgA-receptor bearing cells. If desired, effector cells can be obtained from the subject to be treated. The target-specific effector cells can be administered as a suspension of cells in a physiologically acceptable solution. The number of cells administered can be in the order of 10-8 to 10-9 but will vary depending on the therapeutic purpose. In general, the amount will be sufficient to obtain localization at the target cell, e.g., a tumor cell expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 and to effect cell killing by, e.g., phagocytosis. Routes of administration can also vary.

Therapy with target-specific effector cells can be performed in conjunction with other techniques for removal of targeted cells. For example, anti-tumor therapy using the compositions (e.g., human antibodies, multispecific and bispecific molecules) of the invention and/or effector cells armed with these compositions can be used in conjunction with chemotherapy. Additionally, combination immunotherapy may be used to direct two distinct cytotoxic effector populations toward tumor cell rejection. For example, anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies linked to anti-Fc-gamma RI or anti-CD3 may be used in conjunction with IgG- or IgA-receptor specific binding agents.

Bispecific and multispecific molecules of the invention can also be used to modulate FcgammaR or FcgammaR levels on effector cells, such as by capping and elimination of receptors on the cell surface. Mixtures of anti-Fc receptors can also be used for this purpose.

The compositions (e.g., human antibodies, multispecific and bispecific molecules and immunoconjugates) of the invention which have complement binding sites, such as portions from IgG1, -2, or -3 or IgM which bind complement, can also be used in the presence of complement. In one embodiment, ex vivo treatment of a population of cells comprising target cells with a binding agent of the invention and appropriate effector cells can be supplemented by the addition of complement or serum containing complement. Phagocytosis of target cells coated with a binding agent of the invention can be improved by binding of complement proteins. In another embodiment target cells coated with the compositions (e.g., human antibodies, multispecific and bispecific molecules) of the invention can also be lysed by complement. In yet another embodiment, the compositions of the invention do not activate complement.

The compositions (e.g., human antibodies, multispecific and bispecific molecules and immunoconjugates) of the invention can also be administered together with complement. Accordingly, within the scope of the invention are compositions comprising human antibodies, multispecific or bispecific molecules and serum or complement. These compositions are advantageous in that the complement is located in close proximity to the human antibodies, multispecific or bispecific molecules. Alternatively, the human antibodies, multispecific or bispecific molecules of the invention and the complement or serum can be administered separately.

Also within the scope of the present invention are kits comprising the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 conjugates or antibody compositions of the invention (e.g., human antibodies, bispecific or multispecific molecules, or immunoconjugates) and instructions for use. The kit can further contain one ore more additional reagents, such as an immunosuppressive reagent, a cytotoxic agent or a radiotoxic agent, or one or more additional human antibodies of the invention (e.g., a human antibody having a complementary activity which binds to an epitope in the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen distinct from the first human antibody).

Accordingly, patients treated with antibody compositions of the invention can be additionally administered (prior to, simultaneously with, or following administration of a human antibody of the invention) with another therapeutic agent, such as a cytotoxic or radiotoxic agent, which enhances or augments the therapeutic effect of the human antibodies.

In other embodiments, the subject can be additionally treated with an agent that modulates, e.g., enhances or inhibits, the expression or activity of Fcy or Fcy receptors by, for example, treating the subject with a cytokine. Preferred cytokines for administration during treatment with the multispecific molecule include of granulocyte colony-stimulating factor (G-CSF), granulocyte-macrophage colony-stimulating factor (GM-CSF), interferon-.gamma. (IFN-.gamma.), and tumor necrosis factor (TNF).

The compositions (e.g., human antibodies, multispecific and bispecific molecules) of the invention can also be used to target cells expressing Fc gamma R or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, for example for labeling such cells. For such use, the binding agent can be linked to a molecule that can be detected. Thus, the invention provides methods for localizing ex vivo or in vitro cells expressing Fc receptors, such as FcgammaR, or VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen. The detectable label can be, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor.

In a particular embodiment, the invention provides methods for detecting the presence of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen in a sample, or measuring the amount of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, respectively, comprising contacting the sample, and a control sample, with a human monoclonal antibody, or an antigen binding portion thereof, which specifically binds to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, respectively, under conditions that allow for formation of a complex between the antibody or portion thereof and VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3. The formation of a complex is then detected, wherein a difference complex formation between the sample compared to the control sample is indicative the presence of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen in the sample. As noted the invention in particular embraces assays for detecting VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen in vitro and in vivo such as immunoassays, radioimmunassays, radioassays, radioimaging assays, ELISAs, Western blot, FACS, slot blot, immunohistochemical assays, and other assays well known to those skilled in the art.

In other embodiments, the invention provides methods for treating an VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 mediated disorder in a subject, e.g., cancer, such as non-solid and solid tumors, sarcomas, hematological malignancies including but not limited to acute lymphocytic leukemia, chronic lymphocytic leukemia, acute myelogenous leukemia, chronic myelogenous leukemia, multiple myeloma, Hodgkin's lymphoma, Non-Hodgkin's lymphoma, cancer of the breast, prostate, lung, ovary, colon, spleen, kidney, bladder, head and neck, uterus, testicles, stomach, cervix, liver, bone, skin, pancreas, brain and wherein the cancer may be non-metastatic, invasive or metastatic, as well as non-malignant disorders such as immune disorders including but not limited to transplant rejection and graft versus host disease, or an autoimmune disease selected from those aforementioned and methods of treating any condition wherein modulation of immune costimulation that involves VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 is therapeutically desirable using anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies or soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen conjugates or other drugs that target and modulate (promote or inhibit) one or more VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 biological activities.

By administering the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibody, soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen conjugate or other drug that targets the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or a portion thereof to a subject, the ability of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen to induce such activities is inhibited or promoted and, thus, the associated disorder is treated. The soluble VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or antigen conjugate or anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibody or fragment containing composition or other drug that targets and modulates VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3, can be administered alone or along with another therapeutic agent, such as a cytotoxic or a radiotoxic agent which acts in conjunction with or synergistically with the antibody composition to treat or prevent the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen mediated disease.

In yet another embodiment, immunoconjugates of the invention can be used to target compounds (e.g., therapeutic agents, labels, cytotoxins, radiotoxins immunosuppressants, etc.) to cells which have VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 cell surface receptors by linking such compounds to the antibody. Thus, the invention also provides methods for localizing ex vivo or in vivo cells expressing VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 (e.g., with a detectable label, such as a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor). Alternatively, the immunoconjugates can be used to kill cells which have VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 cell surface receptors by targeting cytotoxins or radiotoxins to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen.

The present invention is further illustrated by the following sequence characterization of a DNA transcript encoding the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, its domains and expression data in normal and cancerous tissues as well as prophetic examples describing the manufacture of fully human antibodies thereto. This information and examples is illustrative and should not be construed as further limiting. The contents of all figures and all references, patents and published patent applications cited throughout this application are expressly incorporated herein by reference.

EXAMPLES Example 1 Methods Used to Analyze the Expression of the RNA Encoding the Proteins of the Invention

The targets of the present invention were tested with regard to their expression in various cancerous and non-cancerous tissue samples and/or with regard to its expression in a wide panel of human samples which contains various types of immune cells, and hematological malignancies samples and cell lines, as well as several samples of normal tissues. The list of the blood specific RNA samples used for the qRT-PCR analysis is provided in Table 1 below. A description of the samples used in the normal tissue panel is provided in Table 2. A description of the samples used in the lung cancer testing panel is provided in Table 3 below. A description of the samples used in the ovary cancer testing panel is provided in Table 4 below. A description of the samples used in the colon cancer testing panel is provided in Table 5 below. The keys for the table 3, 4 and 5 are given in tables 3_1, 4_1, and 5_1, respectively. Tests were then performed as described in the “Materials and Experimental Procedures” section below.

TABLE 1 Samples in blood specific panel Blood panel sample Description Organ/Cell type Tumor Type 1_PBMC2 PBMCs blood-derived cells 2_PBMC3 PBMCs blood-derived cells 3_Bcell1 B cells blood-derived cells 4_Bcell2 B cells blood-derived cells 5_J_Bcell B cells blood-derived cells 6_K_Bcells_act Bcells activated blood-derived cells 7_Tcell1 T cells blood-derived cells 8_Tcell2 T cells blood-derived cells 9_M_CD8 CD4+ T cells blood-derived cells 10_G_CD4_unt CD8+ T cells blood-derived cells 11_H_CD4_Beads CD4+ w Activation blood-derived cells beads 12_I_CD4_Beads_IL12 CD4 w act. blood-derived cells Beads + IL12 13_95_CD4+CD25− CD4+CD25− blood-derived cells 15_NK NK cells blood-derived cells 16_CD34+_1548 CD34+(PCBM1548) blood-derived cells 17_CD34+_1028 CD34+(PCBM1028) blood-derived cells 18_PMN PMNs blood-derived cells 19_A_Mono Monocytes blood-derived cells 20_B_Macro_imma Macrophages blood-derived cells immature 21_C_Macro_mat Macrophages blood-derived cells mature 22_D_DCs_immat DCs immature blood-derived cells 23_E_DCs_mat_LPS DCs mature LPS blood-derived cells 24_F_DCs_mat_CK DCs mature CK blood-derived cells 25_L_DCs + T DCs + T cells blood-derived cells 26_Lym1 13987A1 Lymph Node Lymphoma 27_Lym2 43594B1 Muscle lymphoma 28_Lym3 65493A1 Testis Lymphoma 29_MalLym3 75894A1 Brain Lymphoma 30_NonHod_SCLym 83325A1 Lymph Node NHL Small Cell 31_NonHod_FolLym 76943A1(5 tubes) Lymph Node NHL Follicular 32_Lym_Fol_GI CN_4_ASRBNA35 NHL Follicular Grade I (Small Cell) 33_Lym_Fol_GII CN_1_113GHA8J NHL Follicular Grade II (mixed Small & Large Cell) 34_Lym_Fol_GIII CN_8_VXML6AXI NHL Follicular Grade III (Large Cell) 35_MalLym1 76218B1 Testis NHL Large Cell 36_MalLym2 76102A1 Lymph Node NHL Large Cell 37_Lym_DifBCell1 CN_2_4HDLNA2R NHL Diffuse Large B-Cell 38_Lym_DifBCell2 CN_3_4M4S7AAM NHL Diffuse Large B-Cell 39_Lym_DifBCell3 CN_5_HEODOAR2 NHL Diffuse Large B-Cell 40_NonHod_Lym1 77332A1(5 tubes) Colon NHL Diffuse Large B-Cell 41_MalLym4 76161A1 Spleen NHL Diffuse Large B-Cell 42_Lym_MantleCell1 CN_6_MAE47AOY NHL Mantle Cell 43_Lym_MantleCell2 CN_7_VJU9OAO9 NHL Mantle Cell 44_NonHod_Lym2 95377A1(5 tubes) Spleen NHL 45_THP_1 THP-1 monocytes AML cell line 46_KG_1 KG-1 myeloblast AML cell line 47_BDCM BDCM B and DC like AML cell line 48_CESS CESS lymphoblasts AML cell line 49_HL60 HL60 myeloblast AML cell line 50_K562 K562 lymphoblasts CML cell line 51_Jurkat Jurkat T lymphoblasts T ALL cell line 52_GA10 GA10 B lymphoblasts Burkitts lymphoma cell line 53_RAMOS RAMOS B lymphoblasts Burkitts lymphoma cell line 54_RAJI RAJI B lymphoblasts Burkitts lymphoma cell line 55_Daudi Daudi B lymphoblasts Burkitts lymphoma cell line 56_NL564 -NL564 B lymphoblasts EBV transformed cell line 57_NL553 NL553 B lymphoblasts EBV transformed cell line 58_SKW6.4 SKW6.4 B cells lymphoblasts EBV transformed cell line 59_NCI_H929 NCI-H929 B lymphoblasts Multiple Myeloma cell line 60_MC/CAR MC/CAR B lymphoblasts Multiple Myeloma cell line 61_U266 U266 B lymphoblasts Multiple Myeloma cell line 62_RPMI8226 RPMI8226 B lymphoblasts Multiple Myeloma cell line 63_IM_9 IM-9 B lymphoblasts Multiple Myeloma cell line 64_cereN cerebellum normal cerebellum normal 65_kidneyN1 kidney normal kidney normal 66_kidneyN2 kidney normal kidney normal 67_KidneyN3 kidney normal kidney normal 68_colonN1 colon normal colon normal 69_colonN2 colon normal colon normal 70_stomN stomach normal stomach normal 71_liverN liver normal liver normal 72_lungN1 lung normal lung normal 73_lungN2 lung normal lung normal 74_small intestineN small intestine small intestine 75_brainN brain normal mix brain normal mix 76_heartN heart normal mix heart normal mix

TABLE 2 Tissue samples in normal panel: Sample Tissue id id(GCI)/case (GCI)/Specimen Sample id id (Asterand) id (Asterand)/RNA id sample name Source Lot no. (Asternd) (GCI) 1-(7)-Bc-Rectum Biochain A610297 2-(8)-Bc-Rectum Biochain A610298 3-GC-Colon GCI CDSUV CDSUVNR3 4-As-Colon Asterand 16364 31802 31802B1 5-As-Colon Asterand 22900 74446 74446B1 6-GC-Small bowel GCI V9L7D V9L7DN6Z 7-GC-Small bowel GCI M3GVT M3GVTN5R 8-GC-Small bowel GCI 196S2 196S2AJN 9-(9)-Am-Stomach Ambion 110P04A 10-(10)-Bc-Stomach Biochain A501159 11-(11)-Bc-Esoph Biochain A603814 12-(12)-Bc-Esoph Biochain A603813 13-As-Panc Asterand 8918 9442 9442C1 14-As-Panc Asterand 10082 11134 11134B1 16-As-Liver Asterand 7916 7203 7203B1 17-(28)-Am-Bladder Ambion 071P02C 18-(29)-Bc-Bladder Biochain A504088 19-(64)-Am-Kidney Ambion 111P0101B 20-(65)-Cl-Kidney Clontech 1110970 21-(66)-Bc-Kidney Biochain A411080 22-GC-Kidney GCI N1EVZ N1EVZN91 23-GC-Kidney GCI BMI6W BMI6WN9F 25-(43)-Bc-Adrenal Biochain A610374 26-(16)-Am-Lung Ambion 111P0103A 28-As-Lung Asterand 9078 9275 9275B1 29-As-Lung Asterand 6692 6161 6161A1 30-As-Lung Asterand 7900 7180 7180F1 31-(75)-GC-Ovary GCI L629FRV1 32-(76)-GC-Ovary GCI DWHTZRQX 33-(77)-GC-Ovary GCI FDPL9NJ6 34-(78)-GC-Ovary GCI GWXUZN5M 36-GC-cervix GCI E2P2N E2P2NAP4 38-(26)-Bc-Uterus Biochain A504090 39-(30)-Am-Placen Ambion 021P33A 40-(32)-Bc-Placen Biochain A411073 41-GC-Breast GCI DHLR1 42-GC-Breast GCI TG6J6 43-GC-Breast GCI E6UDD E6UDDNCF 44-(38)-Am-Prostate Ambion 25955 45-Bc-Prostate Biochain A609258 46-As-Testis Asterand 13071 19567 19567B1 47-As-Testis Asterand 19671 42120 42120A1 49-GC-Artery GCI YGTVY YGTVYAIN 50-TH-Blood-PBMC Tel- 52497 Hashomer 51-TH-Blood-PBMC Tel- 31055 Hashomer 52-TH-Blood-PBMC Tel- 31058 Hashomer 53-(54)-Ic-Spleen Ichilov CG-267 54-(55)-Am-Spleen Ambion 111P0106B 54-(55)-Am- Ambion Spleen 56-(58)-Am-Thymus Ambion 101P0101A 57-(60)-Bc-Thyroid Biochain A610287 58-(62)-Ic-Thyroid Ichilov CG-119-2 59-Gc-Sali gland GCI NNSMV NNSMVNJC 60-(67)-Ic-Cerebellum Ichilov CG-183-5 61-(68)-Ic-Cerebellum Ichilov CG-212-5 62-(69)-Bc-Brain Biochain A411322 63-(71)-Bc-Brain Biochain A411079 64-(72)-Ic-Brain Ichilov CG-151-1 65-(44)-Bc-Heart Biochain A411077 66-(46)-Ic-Heart Ichilov CG-227-1 67-(45)-Ic-Heart Ichilov CG-255-9 (Fibrotic) 68-GC-Skel Mus GCI T8YZS T8YZSN7O 69-GC-Skel Mus GCI Q3WKA Q3WKANCJ 70-As-Skel Mus Asterand 8774 8235 8235G1 71-As-Skel Mus Asterand 8775 8244 8244A1 72-As-Skel Mus Asterand 10937 12648 12648C1 73-As-Skel Mus Asterand 6692 6166 6166A1

TABLE 3 Lung cancer testing panel sample id (GCI)/ TISSUE RNA case id ID ID (Asterand)/ (GCI)/ (GCI)/ Spec- Source/ lot no. specimen Sample Diag imen Tis- Deliv- sample (old ID ID re- loca- Tum sue ery name samples) (Asterand) (Asterand) Diag marks tion Gr TNM CS % Gen LC GCI 1-GC- 7Z9V4 7Z9V4AYM Aden BC IA 80 F BAC-SIA LC GCI 2-GC- ZW2AQ ZW2AQARP Aden BC IB 70 F BAC-SIB LC Bioch 72-(44)- A501123 AC 2 UN F Bc-BAC LC GCI 4-GC- 3MOPL 3MOPLA79 Aden IA 60 M Adeno- SIA LC GCI 5-GC- KOJXD KOJXDAV4 Aden IA 90 F Adeno- SIA LC GCI 6-GC- X2Q44 X2Q44A79 Aden IA 85 M Adeno- SIA LC GCI 7-GC- 6BACZ 6BACZAP5 Aden IA 60 F Adeno- SIA LC GCI 8-GC- BS9AF BS9AFA3E Aden IA 55 F Adeno- SIA LC GCI 9-GC- UCLOA UCLOAA9L Aden IA 80 F Adeno- SIA LC GCI 10-GC- BVYK3 BVYK3A7Z Aden IA 60 F Adeno- SIA LC GCI 11-GC- U4DM4 U4DM4AFZ Aden IB 65 F Adeno- SIB LC GCI 12-GC- OWX5Y OWX5YA3S Aden IB 90 M Adeno- SIB LC GCI 13-GC- XYY96 XYY96A6B Aden IIA 70 F Adeno- SIIA LC GCI 14-GC- SO7B1 SO7B1AIJ Aden IIA 70 M Adeno- SIIA LC GCI 15-GC- QANSY QANSYACD Aden IIIA 65 F Adeno- SIIIA LC Bioch 16-(95)- A610063 Aden 1 UN F BC-Adeno LC Bioch 17-(89)- A609077 Aden 2-3 UN M Bc-Adeno LC Bioch 18-(76)- A609218 Aden 3 UN M Bc-Adeno LC Bioch 74-(2)- A504118 Aden 1 UN M Bc-Adeno LC Bioch 75-(77)- A608301 Aden 2 UN M Bc-Adeno LC Bioch 76-(75)- A609217 Aden 2 UN M Bc-Adeno LC Bioch 78-(13)- A504116 Aden 2-3 UN M Bc-Adeno LC Ichilov 81-(14)- CG-111 Aden UN UN M Ic-Adeno LC Aster 19-As- 9220 9418 9418A1 SCC 1 TXN0M0 Occult 80 M Sq-S0 LC GCI 20-GC- U2QHS U2QHSA2N SCC IA 55 F Sq-SIA LC GCI 21-GC- TRQR7 TRQR7ACD SCC IB 75 M Sq-SIB LC Aster 22-As- 17581 32603 32603B1 SCC 3 T2N0M0 IB 90 M Sq-SIB LC Aster 23-As- 18309 41454 41454B1 SCC 2 T2N0MX IB 100 M Sq-SIB LC Aster 24-As- 9217 9415 9415B1 SCC 2 T2N0M0 IB 90 M Sq-SIB LC GCI 25-GC- RXQ1P RXQ1PAEA SCC IIB 55 F Sq-SIIB LC GCI 26-GC- KB5KH KB5KHA6X SCC IIB 65 M Sq-SIIB LC GCI 27-GC- LAYMB LAYMBALF SCC IIIA 65 F Sq-SIIIA LC Ichilov 29-(25)- CG-204 SCC UN UN M Ic-Sq LC Bioch 30-(19)- A408175 SCC 1 UN M Bc-Sq LC Bioch 31-(78)- A607125 SCC 2 UN M Bc-Sq LC Bioch 32-(16)- A409091 SCC 2 UN F Bc-Sq LC Bioch 33-(80)- A609163 SCC 2 UN M Bc-Sq LC Bioch 34-(18)- A503387 SCC 2-3 UN M Bc-Sq LC Bioch 35-(81)- A609076 SCC 3 UN M Bc-Sq LC Bioch 82-(21)- A503187 SCC 2 UN M Bc-Sq LC Bioch 83-(17)- A503183 SCC 2 UN M Bc-Sq LC Bioch 84-(79)- A609018 SCC 3 UN M Bc-Sq LC Bioch 85-(22)- A503386 SCC UN UN M Bc-Sq LC Bioch 86-(20)- A501121 SCC UN UN M Bc-Sq LC Bioch 87-(88)- A609219 SCC UN UN M Bc-Sq LC Bioch 88-(100)- A409017 SCC UN UN M Bc-Sq LC Ichilov 89-(24)- CG-123 SCC UN UN M Ic-Sq LC GCI 36-GC- AF8AL AF8ALAAL LCC IA 85 M LCC-SIA LC GCI 37-GC- O62XU O62XUA1X LCC IB 75 F LCC-SIB LC GCI 38-GC- OLOIM OLOIMAS1 LCC IB 70 M LCC-SIB LC GCI 39-GC- 1ZWSV 1ZWSVAB9 LCC IIB 50 M LCC-SIIB LC GCI 40-GC- 2YHOD 2YHODA1H LCC NSCC . . . IIB 95 M LCC-SIIB LC GCI 41-GC- 38B4D 38B4DAQK LCC IIB 90 F LCC-SIIB LC Bioch 90-(39)- A504114 LCC UN UN F Bc-LCC LC Bioch 91-(87)- A609165 LCC 3 UN F Bc-LCC LC Bioch 92-(38)- A504113 LCC UN UN M Bc-LCC LC Bioch 93-(82)- A609170 LCNC UN UN M Bc-LCC LC GCI 42-GC- QPJQL QPJQLAF6 SMCC NC 3 IB 65 F SCC-SIB LC Bioch 43-(32)- A501391 SMCC UN M Bc-SCC LC Bioch 44-(30)- A501389 SMCC 3 UN M Bc-SCC LC Bioch 45-(83)- A609162 SMCC UN UN F Bc-SCC LC Bioch 46-(86)- A608032 SMCC 3 UN F Bc-SCC LC Bioch 47-(31)- A501390 SMCC UN F Bc-SCC LC Bioch 48-(84)- A609167 SMCC UN UN F Bc-SCC LC Bioch 49-(85)- A609169 SMCC UN UN M Bc-SCC LC Bioch 50-(33)- A504115 SMCC UN M Bc-SCC LN Aster 51-As- 9078 9275 9275B1 Norm- PS M N-PS L LN Aster 52-As- 8757 8100 8100B1 Norm- PM (Right), F N-PM L Lobe Infe- rior LN Aster 53-As- 6692 6161 6161A1 Norm- PM M N-PM L LN Aster 54-As- 7900 7180 7180F1 Norm- PM F N-PM L LN Aster 55-As- 8771 8163 8163A1 Norm- PM (Left), M N-PM L Lobe Supe- rior LN Aster 56-As- 13094 19763 19763A1 Norm- PM M N-PM L LN Aster 57-As- 19174 40654 40654A2 Norm- PM F N-PM L LN Aster 58-As- 13128 19642 19642A1 Norm- PM F N-PM L LN Aster 59-As- 14374 20548 20548C1 Norm- PM (Right), F N-PM L Lobe Supe- rior LN Amb 60-(99)- 36856 N-PM PM M Am-NPM LN Amb 61-(96)- 36853 N-PM PM F Am-NPM LN Amb 62-(97)- 36854 N-PM PM M Am-NPM LN Amb 63-(93)- 111P0103A N-PM PM- F Am-NPM ICH LN Amb 64-(98)- 36855 N-PM PM F Am-NPM LN Bioch 69-(91)- A607257 N-P2- PM P2 Bc-NPM PM LN Bioch 70-(90)- A608152 N-P2PM PM P2 Bc-NPM Source/ # Cig. # of Y. # Y. Recov- Cause Tis- Deliv- sample Ethnic Smoking Per Use of off Sm Sm Dr # ery of Exc. sue ery name age B Status day Tobacco Tobacco P Y? ppl Al Dr Type Death Y. LC GCI 1-GC- 63 WCAU Prev 20 15 27 N . Y 0 Surg 2001 BAC-SIA U. LC GCI 2-GC- 56 WCAU Prev 15 28 10 Y 1 Y 6 Surg 2002 BAC-SIB U. LC Bioch 72-(44)- 61 Bc-BAC LC GCI 4-GC- 68 WCAU Nev . . . N . N . Surg 2001 Adeno- U. SIA LC GCI 5-GC- 64 WCAU Prev 15 40 7 Y 1 N 0 Surg 2003 Adeno- U. SIA LC GCI 6-GC- 58 WCAU Prev 10 47 0 Y 2 N . Surg 2004 Adeno- U. SIA LC GCI 7-GC- 65 WCAU Curr  6 30 . Y 1 N . Surg 2004 Adeno- U. SIA LC GCI 8-GC- 59 WCAU Curr 20 40 . N . N . Surg 2004 Adeno- U. SIA LC GCI 9-GC- 69 WCAU Curr 30 52 . Y 4 N . Surg 2005 Adeno- U. SIA LC GCI 10-GC- 60 WCAU Curr 40 40 . N . N . Surg 2002 Adeno- U. SIA LC GCI 11-GC- 68 WCAU Prev  5  4 43 N . N . Surg 2003 Adeno- U. SIB LC GCI 12-GC- 69 WCAU Curr 10 . . . N . Surg 2002 Adeno- U. SIB LC GCI 13-GC- 62 WCAU Prev  6 40 6 N . Y 0 Surg 2004 Adeno- U. SIIA LC GCI 14-GC- 56 WCAU Curr 30 25 . Y 1 N . Surg 2001 Adeno- U. SIIA LC GCI 15-GC- 61 WCAU Curr 30 36 . Y 1 N . Surg 2004 Adeno- U. SIIIA LC Bioch 16-(95)- 54 BC-Adeno LC Bioch 17-(89)- 62 Bc-Adeno LC Bioch 18-(76)- 57 Bc-Adeno LC Bioch 74-(2)- 64 Bc-Adeno LC Bioch 75-(77)- 44 Bc-Adeno LC Bioch 76-(75)- 65 Bc-Adeno LC Bioch 78-(13)- 64 Bc-Adeno LC Ichilov 81-(14)- 68 Ic-Adeno LC Aster 19-As- 67 CAU Curr 11-20 31-40 O Surg 2003 Sq-S0 U. LC GCI 20-GC- 68 WCAU Prev 10 20 0 N . N . Surg 2004 Sq-SIA U. LC GCI 21-GC- 62 WCAU Prev 20 50 0 Y 5 N . Surg 2005 Sq-SIB U. LC Aster 22-As- 73 CAU Prev O Surg 2004 Sq-SIB U. LC Aster 23-As- 66 CAU Prev 11-20 45 P Surg 2005 Sq-SIB U. LC Aster 24-As- 65 CAU Curr  6-10 41-50 O Surg 2002 Sq-SIB U. LC GCI 25-GC- 44 WCAU Prev 20 20 0 Y 2 N . Surg 2004 Sq-SIIB U. LC GCI 26-GC- 68 WCAU Prev 40 40 0 Y 2 N . Surg 2004 Sq-SIIB U. LC GCI 27-GC- 58 WCAU Prev 50 40 1 Y 2 N . Surg 2004 Sq-SIIIA U. LC Ichilov 29-(25)- 72 Ic-Sq LC Bioch 30-(19)- 78 Bc-Sq LC Bioch 31-(78)- 62 Bc-Sq LC Bioch 32-(16)- 68 Bc-Sq LC Bioch 33-(80)- 74 Bc-Sq LC Bioch 34-(18)- 63 Bc-Sq LC Bioch 35-(81)- 53 Bc-Sq LC Bioch 82-(21)- 52 Bc-Sq LC Bioch 83-(17)- 57 Bc-Sq LC Bioch 84-(79)- 67 Bc-Sq LC Bioch 85-(22)- 48 Bc-Sq LC Bioch 86-(20)- 64 Bc-Sq LC Bioch 87-(88)- 64 Bc-Sq LC Bioch 88-(100)- 64 Bc-Sq LC Ichilov 89-(24)- 76 Ic-Sq LC GCI 36-GC- 45 WCAU Prev 45 33 0 Y 2 Y 28  Surg 2004 LCC-SIA U. LC GCI 37-GC- 60 WCAU Prev 30 45 0 Y 3 N . Surg 2004 LCC-SIB U. LC GCI 38-GC- 68 WCAU Prev . 55 . Y . N . Surg 2001 LCC-SIB U. LC GCI 39-GC- 51 WCAU Prev 20 12 22 Y 1 N . Surg 2004 LCC-SIIB U. LC GCI 40-GC- 62 WCAU Prev 40 40 0 Y 2 Y 12  Surg 2004 LCC-SIIB U. LC GCI 41-GC- 70 WCAU Prev 30 50 . Y 2 Y 13  Surg 2002 LCC-SIIB U. LC Bioch 90-(39)- 35 Bc-LCC LC Bioch 91-(87)- 47 Bc-LCC LC Bioch 92-(38)- 58 Bc-LCC LC Bioch 93-(82)- 68 Bc-LCC LC GCI 42-GC- 62 WCAU Prev 20 35 0.15 Y 2 N . Surg 2003 SCC-SIB U. LC Bioch 43-(32)- 30 Bc-SCC LC Bioch 44-(30)- 34 Bc-SCC LC Bioch 45-(83)- 47 Bc-SCC LC Bioch 46-(86)- 52 Bc-SCC LC Bioch 47-(31)- 59 Bc-SCC LC Bioch 48-(84)- 59 Bc-SCC LC Bioch 49-(85)- 66 Bc-SCC LC Bioch 50-(33)- Bc-SCC LN Aster 51-As- 22 CAU Nev NU Surg 2003 N-PS U. LN Aster 52-As- 26 CAU Nev O Aut CA 2003 N-PM U. LN Aster 53-As- 37 CAU Nev C Aut MCE 2002 N-PM U. LN Aster 54-As- 76 CAU Prev Aut CPul 2002 N-PM U. A LN Aster 55-As- 81 CAU Prev 41 or 31-40 O Aut CA 2003 N-PM U. more LN Aster 56-As-  0 CAU Prev 21-40 41-50 P Aut IC N-PM U. LN Aster 57-As- 69 CAU Curr 21-40 31-40 P Aut CPul 2005 N-PM U. A LN Aster 58-As- 75 CAU Aut CPul 2004 N-PM A LN Aster 59-As- 75 CAU Aut Cer 2004 N-PM A LN Amb 60-(99)- 31 Am-NPM LN Amb 61-(96)- 43 Am-NPM LN Amb 62-(97)- 46 Am-NPM LN Amb 63-(93)- 61 Am-NPM LN Amb 64-(98)- 72 Am-NPM LN Bioch 69-(91)- 24, 29 Bc-NPM LN Bioch 70-(90)- 27, 28 Bc-NPM

TABLE 3_1 Key Full Name # Cig. Per day Number of Cigarettes per day # Dr Number of Drinks # of Y. Use of Tobacco Number of Years Using Tobacco # Y. off Tobacco Number of Years Off Tobacco AC Alveolus carcinoma Aden ADENOCARCINOMA Amb Ambion Aster Asterand Aut Autopsy BC BRONCHIOLOALVEOLAR CARCINOMA Bioch Biochain C Current Use CA Cardiac arrest CAU Caucasian Cer A Cerebrovascular accident CPul A Cardiopulmonary arrest CS Cancer Stage Curr U. Current Use Diag Diagnosis Dr Al Drink Alcohol? Exc Y. Excision Year Gen Gender Gr Grade Height HT IC Ischemic cardiomyopathy LC Lung Cancer LCC LARGE CELL CARCINOMA LCNC Large Cell Neuroendocrine Carcinoma LN Lung Normal MCE Massive cerebral edema N No NC NEUROENDOCRINE CARCINOMA Nev. U. Never Used Norm-L Normal Lung N-P2-PM Normal (Pool 2)-PM N-PM Normal-PM NSCC... NON-SMALL CELL CARCINOMA WITH SARCOMUTOUS TRANSFORMTAIO NU Never Used O Occasional Use P Previous Use P2 Pool 2 Prev U. Previous Use SCC Squamous Cell Carcinoma Sm P Y? Have people at home smoked in past 15 yr Sm ppl If yes, how many? SMCC SMALL CELL CARCINOMA SMOKE_GROWN Did people smoke at home G_UP while growing up Surg Surgical Tum % Tumor Percentage WCAU White Caucasian Y Yes

TABLE 4 Tissue samples in ovary panel sample_id (GCI)/ case id RNA ID Source/ (Asterand)/ (GCI)/ CA Meno- Tis- Deliv- sample lot no. (old Sample ID C Tumor Ethnic 125 pausal Mens sue ery name samples) (Asterand) Diag Stage % age BG PRE Status Age OVC Asterand 1-As- 23074 71900A2 SA I 80 49 CAU Pre-M Ser SI OVC Asterand 2-As- 22653 70270A1 SA I 90 69 WCAU Post-M Ser SI OVC Asterand 3-As- 18700 40771B1 SA IB 100 62 WCAU Post-M Ser SIB OVC GOG 79-(32)- 93-09-4901 SPC 1B 67 GO-Ser SIB OVC Asterand 4-As- 17646 32667B1 SA IB 100 68 W Post-M Ser SIB OVC Asterand 5- As- 15644 22996A1 SA IC 100 48 CAU M Ser SIC OVC Asterand 6-As- 18701 40773C1 SA IIA 100 59 CAU Post-M Ser SIIA OVC GCI 7-GC- 2O37O SA IIB 75 43 WCAU . Pre-M 12 Ser SIIB OVC GCI 8-GC- 7B3DP SA IIB 70 70 WCAU . Post-M 14 Ser SIIB OVC GOG 80-(30)- 2001-08-G011 PSC 3A 72 GO-Ser SIIIA OVC GOG 81-(70)- 95-08-G069 PSA 3B 50 GO-Ser SIIIB OVC GOG 82-(5)- 99-12-G432 A 3C 46 >500 GO-Ser SIIIC OVC Asterand 9-As- 13268 19832A1 SA IIIC 90 48 C Post-M Ser SIIIC OVC GOG 83-(29)- 2001-12-G035 SA 3C 50 260 GO-Ser SIIIC OVC GCI 10-GC- 3NTIS SA IIIC 70 53 WCAU 70 Post-M 12 Ser SIIIC OVC GCI 11-GC- CEJUS SA IIIC 70 53 WCAU 4814 Pre-M . Ser SIIIC OVC GCI 12-GC- 5NCLK SA IIIC 70 54 WCAU 209 Post-M 13 Ser SIIIC OVC ABS 84-(25)- N0021 PSA 3C 55 CAU AB-Ser SIIIC OVC GCI 13-GC- 1HI5H SA IIIC 90 61 WCAU 34 Post-M 12 Ser SIIIC OVC GCI 14-GC- 7RMHZ SA IIIC 80 63 WCAU . Post-M 12 Ser SIIIC OVC GCI 15-GC- 4WAAB SA IIIC 90 63 WCAU . Post-M 11 Ser SIIIC OVC GCI 16-GC- 79Z67 SA IIIC 85 67 WCAU . Post-M 12 Ser SIIIC OVC GOG 85-(13)- 94-05-7603 APP 3C 67 GO-Ser SIIIC OVC GCI 17-GC- DDSNL SA IIIC 70 68 WCAU . Post-M 11 Ser SIIIC OVC GCI 18-GC- DH8PH SA IV 95 70 WCAU . Post-M 13 Ser SIV OVC BioChain 86-(33)- A503175 SPC 41 Asian BC-Ser OVC BioChain 87-(14)- A501111 A 41 Asian Bc-Ser OVC Biochain 88-(12)- A406023 A 45 Asian Bc-Ser OVC Biochain 89-(11)- A407068 A 49 Asian Bc-Ser OVC ABS 90-(4)- ILS-7286 PC UN 50 Asian AB-Ser OVC ABS 91-(6)- A0106 A UN 51 Asian AB-Ser OVC ABS 92-(3)- ILS-1431 PA UN 52 Asian AB-Ser OVC BioChain 93-(31)- A503176 SPC 52 Asian Bc-Ser OVC ABS 94-(2)- ILS-1408 PA UN 53 Asian AB-Ser OVC ABS 95-(7)- IND-00375 A 59 Asian AB-Ser OVC BioChain 96-(8)- A501113 A 60 Asian Bc-Ser OVC Biochain 97-(10)- A407069 A 60 Asian Bc-Ser OVC ABS 98-(1)- ILS-1406 PA UN 73 Asian AB-Ser OVC GCI 19-GC- E2WKF EA IA 70 30 WCAU . Pre-M 12 Endo SIA OVC GCI 20-GC- 5895C EA IA 95 39 WCAU . Pre-M 14 Endo SIA OVC GCI 21-GC- 533DX EA IA 95 50 WCAU 190 Pre-M 11 Endo SIA OVC GCI 22-GC- HZ2EY EA IA 90 55 WCAU 1078 Pre-M 13 Endo SIA OVC GCI 23-GC- RWOIV EA IA 65 47 WCAU 1695 Pre-M 14 Endo SIA OVC GCI 24-GC- 1U52X EA IIA 95 61 WCAU 275 . Endo SIIA OVC GCI 25-GC- A17WS EA IIB 70 67 WCAU 78 Post-M 14 Endo SIIB OVC GCI 26-GC- 1VT3I EA IIIC 90 50 WCAU . Pre-M 12 Endo SIIIC OVC GCI 27-GC- PZQXH EA IIIC 80 52 WCAU . Pre-M 11 Endo SIIIC OVC GCI 28-GC- I8VHZ EA IV 90 68 WCAU . Post-M . Endo SIV OVC GOG 99-(41)- 98-03-G803 Mixed . . . 2 38 >35 GO-Ser Mix SII OVC GOG 100-(40)- 95-11-G006 PS & EC 3C 49 GO-Ser Mix SIIIC OVC GOG 101-(37)- 2002-05-G513 MS & EA 3C 56 GO-Ser Mix SIIIC OVC GOG 102-(38)- 2002-05-G509 MS & EAM 3C 64 GO-Ser Mix SIIIC OVC GOG 103-(34)- 95-04-2002 PEA 3C 68 GO-Ser Mix SIIIC OVC GOG 29-(21)- 95-10-G020 MC IA 44 >100 GO-Muc SIA OVC GCI 30-GC- IMDA1 MA IC 70 41 WCAU 50 Pre-M 12 Muc SIC OVC Asterand 31-As- 12742 18920A1 MA IC 70 61 C Post-M Muc SIC OVC ABS 32-(22)- A0139 MC IC 72 Asian AB-Muc SIC OVC GCI 33-GC- NJM4U MA IIA 80 51 WCAU Muc SIIA OVC ABS 34-(20)- USA-00273 PMC IIIA 45 C AB-Muc SIIIA OVC GCI 35-GC- RAFCW MA IIIA 75 55 WCAU 95 Post-M 13 Muc SIIIA OVC Asterand 36-As- 23177 72888A1 MA IIIC 60 52 C Pre-M Muc SIIIC OVC Asterand 37-As- 16103 29374B1 MA IIIC 100 62 W Post-M Muc SIIIC OVC BioChain 104-(19)- A504085 MA 34 Asian Bc-Muc OVC BioChain 105-(18)- A504083 MA 45 Asian Bc-Muc OVC BioChain 106-(17)- A504084 MA 51 Asian Bc-Muc OVC BioChain 107-(15)- A407065 C 27 Asian Bc-Car OVC Clontech 108-(16)- 1090387 CNOS 58 Asian Cl-Car OVC GOG 109-(44)- 2001-07-G084 CCA 1A 73 GO-Clear cell SIA OVC GOG 110-(43)- 2001-10-G002 CCA 3A 74 slightly GO-Clear elevated cell SIIIA OVC_BT GCI 38-GC- SC656 MBT IA 75 40 WCAU 138 Pre-M 13 Muc Border SIA OVC_BT GCI 39-GC- 3D5FO MBT IA 85 51 WCAU 19 ? 15 Muc Border SIA OVC_BT GCI 40-GC- 7JP3F MBT IA 75 56 WCAU 125 Post-M 14 Muc Border SIA OVC_BT ABS 111-(23)- VNM-00187 MC Low M 45 Asian AB-Border OVC_BT GOG 112-(42)- 98-08-G001 EA of BM 1A 46 GO-Border SIA OVC_B GOG 41-(62)- 99-10-G442 BMC 32 6 Go-Ben Muc OVC_B GCI 43-GC- QLIKY BMC 100 42 WCAU Ben Muc OVC_B Asterand 44-As- 16870 30534A1 BMC 100 45 W Pre-M Ben Muc OVC_B GOG 45-(56)- 99-01-G407 BMC 46 GO-Ben Muc OVC_B GCI 46-GC- 943EC BMC 75 54 WCAU Ben Muc OVC_B GCI 47-GC- JO8W7 BMC 50 56 WCAU Ben Muc OVC_B Asterand 48-As- 17016 30645B1 BSC IA 100 38 C Pre-M Ben Ser OVC_B GOG 49-(64)- 99-06-G039 BSC 57 GO-Ben Ser OVC_B GCI 50-GC- DQQ2F BSCF 95 68 WCAU Ben Ser OVC_B Asterand 51-As- 8786 8275A1 BSC 100 80 CAU Post-M Ben Ser OVC_NBM Asterand 52-As- 15690 23054A1 NO-BM 52 CAU Pre-M NBM OVC_NBM Asterand 53-As- 16843 30488A1 NO-BM 57 W Post-M NBM OVC_NBM Asterand 54-As- 16850 30496B1 NO-BM 65 W Post-M NBM OVC_NBM Asterand 55-As- 16848 30499C1 NO-BM 66 CAU Post-M NBM OVC_N GCI 56-GC- WPU1U NO-PS 0 32 WC NPS OVC_N GCI 57-GC- Y9VHI NO-PS 0 35 WCAU NPS OVC_N GCI 58-GC- 76VM9 NO-PS 0 41 WCAU NPS OVC_N GCI 59-GC- DWHTZ NO-PS 0 42 WCAU NPS OVC_N GCI 60-GC- SJ2R2 NO-PS 0 43 WCAU NPS OVC_N GCI 61-GC- 9RQMN NO-PS 0 45 WCAU NPS OVC_N GCI 62-GC- TOAE5 NO-PS 0 45 WCAU NPS OVC_N GCI 63-GC- TW9PM NO-PS 0 46 WCAU NPS OVC_N GCI 64-GC- 2VND2 NO-PS 0 46 WCAU NPS OVC_N GCI 65-GC- L629F NO-PS 0 47 WCAU NPS OVC_N GCI 66-GC- XLB23 NO-PS 0 47 WCAU NPS OVC_N GCI 67-GC- IDUVY NO-PS 0 47 WCAU NPS OVC_N GCI 68-GC- ZCXAD NO-PS 0 48 WCAU NPS OVC_N GCI 69-GC- PEQ6C NO-PS 0 49 WCAU NPS OVC_N GCI 70-GC- DD73B NO-PS 0 49 WCAU NPS OVC_N GCI 71-GC- E2UF7 NO-PS 0 53 WCAU NPS OVC_N GCI 72-GC- GWXUZ NO-PS 0 53 WCAU NPS OVC_N GCI 73-GC- 4YG5P NO-PS 0 55 WCAU NPS OVC_N GCI 74-GC- FDPL9 NO-PS 0 56 WCAU NPS OVC_N BioChain 75-(45)- A503274 NO-PM 41 Asian Bc-NPM OVC_N BioChain 76-(46)- A504086 NO-PM 41 Asian Bc-NPM OVC_N Ichilov 77-(71)- CG-188-7 NO-PM 49 Ic-NPM OVC_N BioChain 78-(48)- A504087 NO-PM 51 Asian Bc-NPM Source/ Age at Oral Oral Tis- Deliv- sample Preg Preg first Con Con Tubal Recovery sue ery name Times Toterm child OC Length Unit ligation Type OVC Asterand 1-As- 2 1 Surg Ser SI OVC Asterand 2-As- 1 1 Surg Ser SI OVC Asterand 3-As- 3 3 Surg Ser SIB OVC GOG 79-(32)- GO-Ser SIB OVC Asterand 4-As- 9 2 Surg Ser SIB OVC Asterand 5- As- 4 2 Surg Ser SIC OVC Asterand 6-As- 1 1 Surg Ser SIIA OVC GCI 7-GC- 0 0  0 NO . NO Surg Ser SIIB OVC GCI 8-GC- 5 3 20 YES 6 months NO Surg Ser SIIB OVC GOG 80-(30)- GO-Ser SIIIA OVC GOG 81-(70)- GO-Ser SIIIB OVC GOG 82-(5)- GO-Ser SIIIC OVC Asterand 9-As- Surg Ser SIIIC OVC GOG 83-(29)- GO-Ser SIIIC OVC GCI 10-GC- 1 1 26 YES 3 months NO Surg Ser SIIIC OVC GCI 11-GC- 2 2 30 NO . NO Surg Ser SIIIC OVC GCI 12-GC- 2 2 21 YES 1 years NO Surg Ser SIIIC OVC ABS 84-(25)- AB-Ser SIIIC OVC GCI 13-GC- 6 3 22 NO . NO Surg Ser SIIIC OVC GCI 14-GC- 2 2 20 YES 10  years NO Surg Ser SIIIC OVC GCI 15-GC- 2 1 29 YES 4 years NO Surg Ser SIIIC OVC GCI 16-GC- 6 5 24 YES 2 years YES Surg Ser SIIIC OVC GOG 85-(13)- GO-Ser SIIIC OVC GCI 17-GC- 4 4 19 NO . NO Surg Ser SIIIC OVC GCI 18-GC- 4 3 20 NO . NO Surg Ser SIV OVC BioChain 86-(33)- BC-Ser OVC BioChain 87-(14)- Bc-Ser OVC Biochain 88-(12)- Bc-Ser OVC Biochain 89-(11)- Bc-Ser OVC ABS 90-(4)- AB-Ser OVC ABS 91-(6)- AB-Ser OVC ABS 92-(3)- AB-Ser OVC BioChain 93-(31)- Bc-Ser OVC ABS 94-(2)- AB-Ser OVC ABS 95-(7)- AB-Ser OVC BioChain 96-(8)- Bc-Ser OVC Biochain 97-(10)- Bc-Ser OVC ABS 98-(1)- AB-Ser OVC GCI 19-GC- 6 5 17 YES 6 years NO Surg Endo SIA OVC GCI 20-GC- 2 2 20 NO . NO Surg Endo SIA OVC GCI 21-GC- 0 . . YES 2 years NO Surg Endo SIA OVC GCI 22-GC- 0 . . NO . NO Surg Endo SIA OVC GCI 23-GC- 0 . . NO . NO Surg Endo SIA OVC GCI 24-GC- . . . . Surg Endo SIIA OVC GCI 25-GC- 0 . . NO . NO Surg Endo SIIB OVC GCI 26-GC- 2 2 24 YES 1 years NO Surg Endo SIIIC OVC GCI 27-GC- 0 . . YES 5 years NO Surg Endo SIIIC OVC GCI 28-GC- 2 2 27 NO . NO Surg Endo SIV OVC GOG 99-(41)- GO-Ser Mix SII OVC GOG 100-(40)- GO-Ser Mix SIIIC OVC GOG 101-(37)- GO-Ser Mix SIIIC OVC GOG 102-(38)- GO-Ser Mix SIIIC OVC GOG 103-(34)- GO-Ser Mix SIIIC OVC GOG 29-(21)- GO-Muc SIA OVC GCI 30-GC- 2 1 24 NO . Surg Muc SIC OVC Asterand 31-As- 3 3 Surg Muc SIC OVC ABS 32-(22)- AB-Muc SIC OVC GCI 33-GC- Surg Muc SIIA OVC ABS 34-(20)- AB-Muc SIIIA OVC GCI 35-GC- 4 3 22 NO . NO Surg Muc SIIIA OVC Asterand 36-As- Surg Muc SIIIC OVC Asterand 37-As- 1 1 Surg Muc SIIIC OVC BioChain 104-(19)- Bc-Muc OVC BioChain 105-(18)- Bc-Muc OVC BioChain 106-(17)- Bc-Muc OVC BioChain 107-(15)- Bc-Car OVC Clontech 108-(16)- Cl-Car OVC GOG 109-(44)- GO-Clear cell SIA OVC GOG 110-(43)- GO-Clear cell SIIIA OVC_BT GCI 38-GC- 2 2 23 NO . YES Surg Muc Border SIA OVC_BT GCI 39-GC- 0 . . NO . NO Surg Muc Border SIA OVC_BT GCI 40-GC- 3 3 19 YES 5 years NO Surg Muc Border SIA OVC_BT ABS 111-(23)- AB-Border OVC_BT GOG 112-(42)- GO-Border SIA OVC_B GOG 41-(62)- Go-Ben Muc OVC_B GCI 43-GC- Surg Ben Muc OVC_B Asterand 44-As- 2 2 Surg Ben Muc OVC_B GOG 45-(56)- GO-Ben Muc OVC_B GCI 46-GC- Surg Ben Muc OVC_B GCI 47-GC- Surg Ben Muc OVC_B Asterand 48-As- 2 2 Surg Ben Ser OVC_B GOG 49-(64)- GO-Ben Ser OVC_B GCI 50-GC- Surg Ben Ser OVC_B Asterand 51-As- 10  9 Surg Ben Ser OVC_NBM Asterand 52-As- 10  3 Surg NBM OVC_NBM Asterand 53-As- 4 2 Surg NBM OVC_NBM Asterand 54-As- 2 2 Surg NBM OVC_NBM Asterand 55-As- 9 2 Surg NBM OVC_N GCI 56-GC- Surg NPS OVC_N GCI 57-GC- Surg NPS OVC_N GCI 58-GC- Surg NPS OVC_N GCI 59-GC- Surg NPS OVC_N GCI 60-GC- Surg NPS OVC_N GCI 61-GC- Surg NPS OVC_N GCI 62-GC- Surg NPS OVC_N GCI 63-GC- Surg NPS OVC_N GCI 64-GC- Surg NPS OVC_N GCI 65-GC- Surg NPS OVC_N GCI 66-GC- Surg NPS OVC_N GCI 67-GC- Surg NPS OVC_N GCI 68-GC- Surg NPS OVC_N GCI 69-GC- Surg NPS OVC_N GCI 70-GC- Surg NPS OVC_N GCI 71-GC- Surg NPS OVC_N GCI 72-GC- Surg NPS OVC_N GCI 73-GC- Surg NPS OVC_N GCI 74-GC- Surg NPS OVC_N BioChain 75-(45)- Bc-NPM OVC_N BioChain 76-(46)- Bc-NPM OVC_N Ichilov 77-(71)- Ic-NPM OVC_N BioChain 78-(48)- Bc-NPM

TABLE 4_1 Key Full Name A Adenocarcinoma APP Adenocarcinoma from primary peritioneal BMC BENIGN MUCINOUS CYSTADENOMA BSC BENIGN SEROUS CYSTADENOMA BSCF BENIGN SEROUS CYSTADENOFIBROMA C Carcinoma C Stage Cancer stage CAU Caucasian CCA Clear cell adenocarcinoma CNOS Carcinoma NOS EA ENDOMETROID ADENOCARCINOMA EA of BM Endometroid adenocarcinoma of borderline malignancy M Menopausal MA MUCINOUS ADENOCARCINOMA MBT MUCINOUS BORDERLINE TUMOR MC Mucinous cystadenocarcinoma MC Low M Mucinous cystadenocarcinoma with low malignant Mens. Age Mestrual Age Mixed . . . Mixed epithelial cystadenocarcinoma with mucinous, endometroid, squamous and papillary serous MS & EA Mixed serous and endometrioid adenocarcinoma MS & EAM Mixed serous and endometrioid adenocarcinoma of mullerian NO-BM NORMAL OVARY-BM NO-PM NORMAL OVARY-PM NO-PS NORMAL OVARY-PS OC Oral Contraceptive OVC Ovary Cancer OVC_B Ovary Benign OVC_BT Ovary Borderline Tumor OVC_N Ovary Normal OVC_N Ovary normal-benign matched BM PA Papillary adenocarcinoma PC Papillary cystadenocarcinoma PEA Papillary endometriod adenocarcinoma PMC Papillary mucinous cystadenocarcinoma Post-M Post-menopausal Pre-M Pre-menopausal PS & EC Papillary serous and endometrioid cystadenocarcinoma PSA Papillary serous adenocarcinoma PSC Papillary serous carcinoma SA SEROUS ADENOCARCINOMA SPC Serous papillary cystadenocarcinoma W White WCAU WHITE/CAUCASIAN

TABLE 5 Colon cancer testing panel sample_id (GCI)/ TISSUE case id ID (Asterand)/ (GCI)/ Spec- Source/ lot no. specimen Sample Diag imen Tis- Deliv- sample (old ID ID re- loca- sue ery name samples (Asterand) (Asterand) Diag marks tion Gr TNM CS CS2 CC Asterand 1-As- 18036 31312 31312B1 Aden Cec 3 TXN0M0 0 AdenS0 CC GCI 2-GC- 4QDH8 4QDH8ADT Aden Dis I Duke AdenoSI C A CC Ichilov 3-(7)- CG-235 AI Rectum UN I Duke Ic- A AdenoSI CC GCI 4-GC- NTAI8 NTAI8AOU Aden Cec I Duke AdenoSI B1 CC GCI 5-GC- ARA7P ARA7PAQA Aden Ret, I Duke AdenoSI Low B1 Ant CC Ichilov 6-(20)- CG-249 UA 3 IIA Duke Ic- B2 AdenoSIIA CC GCI 7-GC- AFTS6 AFTS6AP6 Aden IIA Duke AdenoSIIA B2 CC GCI 8-GC- 5CYDK 5CYDKACS Aden IIA Duke AdenoSIIA B2 CC GCI 9-GC- XKSLS XKSLSAF7 Aden IIA Duke AdenoSIIA B2 CC GCI 10-GC- B4RU8 B4RU8A8Q Aden IIA Duke AdenoSIIA B2 CC GCI 11-GC- HB8EY HB8EYA8I Aden IIA Duke AdenoSIIA B2 CC Ichilov 12-(22)- CG-229C Aden 2 II Duke AdenoSII B CC GCI 13-GC- X8C7X X8C7XATL Aden IIA Duke AdenoSIIA B2 CC GCI 14-GC- HCP6K HCP6KA8Z Aden IIA Duke AdenoSIIA B2. CC GCI 15-GC- ZX4X7 ZX4X7AXA Aden IIA Duke AdenoSIIA B2 CC Asterand 16-As- 17915 31176 31176A1 Aden 2-3 T3N0M0 IIA Duke AdenoSIIA B2 CC Ichilov 17-(1)- CG-335 Aden Cec 2 IIA Duke Ic- B2 AdenoSIIA CC Asterand 19-As- 12772 18885 18885A1 Aden rectum 2 T3NXM0 IIA Duke AdenoSIIA B2 CC GCI 20-GC- JFYXP JFYXPAMP Aden IIA Duke AdenoSIIA B2 CC GCI 21-GC- OJXW9 OJXW9ASR Aden IIA Duke AdenoSIIA B2 CC Ichilov 22-(28)- CG-284 Aden sigma 2 IIA Duke Ic- B2 AdenoSIIA CC Ichilov 23-(10)- CG-311 Aden Sig 1-2 eIIA Duke Ic- Col B2 AdenoSIIA CC Ichilov 24-(14)- CG-222(2) WP Rectum III Duke Ic- Aden C AdenoSIII CC Ichilov 25-(23)- CG-282 MA sigma UN III Duke Ic- C AdenoSIII CC GCI 26-GC- OTPI7 OTPI7AWY Aden III Duke AdenoSIII C2 CC GCI 27-GC- IG9NK IG9NKAD3 MA III Duke AdenoSIII C2 CC GCI 28-GC- 53OM7 53OM7AGL Aden III Duke AdenoSIII C2 CC GCI 29-GC- BLUW6 BLUW6A6Y Aden III Duke AdenoSIII C2 CC GCI 30-GC- VZ6QA VZ6QAAFA Aden RECTUM III Duke AdenoSIII C2 CC Ichilov 31-(6)- CG-303(3) Aden 1-2 III Duke Ic- C2. AdenoSIII CC Ichilov 32-(2)- CG-307 Aden Cecum 2 III Duke Ic- C2. AdenoSIII CC Ichilov 33-(11)- CG-337 Aden 1-2 III Duke Ic- C2. AdenoSIII CC Asterand 34-As- 18462 40971 40971A1 TA Sig 2 TXN2M0 IIIC AdenoSIIIC Col CC Ichilov 35-(13)- CG-290 Aden Rect 2 IV Duke Ic- Col D. AdenoSIV CC GCI 36-GC- 7D7QV 7D7QVAE6 Aden IV Duke AdenoSIV D CC GCI 37-GC- 38U4V 38U4VAA4 Aden IV Duke AdenoSIV D CC Ichilov 38-(9)- CG-297 Aden Rectum 2 IV Duke Ic- D. AdenoSIV CC Ichilov 71-(16)- CG-278C Aden 2 UN Ic- Adeno CC Ichilov 72-(4)- CG-276 Carc 3 UN Ic- Adeno CC Ichilov 73-(17)- CG-163 Aden Rectum 2 UN Ic- Adeno CC Ichilov 74-(5)- CG-308 Aden Col 2 UN Ic- Sig Adeno CC Ichilov 75-(72)- CG-309 Aden 3 UN Ic- Adeno CC Ichilov 76-(18)- CG-22C Aden UN UN Ic- Adeno CC Ichilov 78-(21)- CG-18C Aden UN UN Ic- Adeno CC Ichilov 79-(24)- CG-12 Aden UN UN Ic- Adeno CC Ichilov 80-(25)- CG-2 Aden UN UN Ic- Adeno CC biochain 82-(61)- A606258 Aden, 2 UN Bc- Ulcer Adeno CC biochain 83-(57)- A609150 Aden 3 UN Bc- Adeno CC biochain 84-(56)- A609148 Aden 2 UN Bc- Adeno CC biochain 85-(53)- A609161 Aden 3 UN Bc- Adeno CC biochain 86-(54)- A609142 Aden 3 UN Bc- Adeno CC biochain 87-(59)- A609059 Aden, 1 UN Bc- Ulcer Adeno CC biochain 88-(60)- A609058 Aden, 2 UN Bc- Ulcer Adeno CC biochain 89-(55)- A609144 Aden 3 UN Bc- Adeno CC biochain 90-(58)- A609152 Aden 1 UN Bc- Adeno CB GCI 40-GC- IG3OY IG3OYN7S TS RT Ben Aden Col CB GCI 41-GC- GKIEY GKIEYAV4 TS Prox Ben Aden T Col HGD CN GCI 42-GC- AGVTG AGVTCNK7 NG DIV NPS CN Asterand 43-As- 8956 9153 9153B1 NG NPS CN GCI 44-GC- IG3OY IG3OYN7S NG RT NPS Col CN GCI 45-GC- K9OYX K9OYXN4F NG Divs w/ LT NPS FDIV Col CN Asterand 46-As- 23024 74445 74445B1 NG Chr NPS Divs CN Asterand 47-As- 23049 71410 71410B2 NG Chr NPS Divs CN GCI 48-GC- G7JJX G7JJXAX7 NG Divs w/ Sig NPS DIV . . . Col CN Asterand 49-As- 22900 74446 74446B1 NG AD w/AF NPS CN GCI 50-GC- XVPZ2 XVPZ2NDD NC Div NPS CN GCI 51-GC- CDSUV CDSUVNR3 NC CU NPS CN GCI 52- GP5KH GP5KHAOC NC Div GC-NPS CN GCI 53-GC- YUZNR YUZNRNDN NC Divs Sig NPS Col CN GCI 54-GC- 28QN6 28QN6NI1 NC TS RT NPS Aden Col CN GCI 55-GC- GV6N8 GV6N8NG9 NC Divs, NPS PA CN GCI 56-GC- ZJ17R ZJ17RNIH NC Tub RT NPS Aden Col CN GCI 57-GC- 2EEBJ 2EEBJN2Q NC Div/ NPS Chr Infl CN GCI 58-GC- 68IX5 68IX5N1H NC Chr LT NPS Div Col CN GCI 59-GC- 9GEGL 9GEGLN1V NC Ext Sig NPS Divs Col CN GCI 60-GC- PKU8O PKU8OAJ3 NG Divs, Sig NPS Chr Col Div . . . CN Asterand 61-As- 22903 74452 74452B1 NG MU w/ NPS MI CN Asterand 62-As- 16364 31802 31802B1 NG UC NPS CN biochain 63-(65)- A607115 N-PM PM Bc- NPM CN Ambion 64-(71)- 071P10B N-PM PM Am-NPM CN biochain 65-(66)- A609262 N-PM PM Bc-NPM CN biochain 66-(63)- A609260 N-PM PM Bc-NPM CN biochain 67-(62)- A608273 N-PM PM Bc- NPM CN biochain 68-(64)- A609261 N-PM PM Bc-NPM CN biochain 69-(41)- A501156 N-PM PM Bc-NPM CN biochain 70-(67)- A406029 + N-PMP10 PM Bc-NPM A411078 Source/ Alco- Dr. Recov- Tis- Deliv- sample Tumor Ethnic hol per Alc. ery Exc. sue ery name % Gender age B Status day Dur. Type Y. CC Asterand 1-As- 80 F 43 CAU NU Auto 2004 AdenS0 CC GCI 2-GC- 85 F 44 WCAU Y 4 Surg AdenoSI CC Ichilov 3-(7)- F 66 Ic- AdenoSI CC GCI 4-GC- 80 M 53 WCAU Y Surg AdenoSI CC GCI 5-GC- 70 F 70 WCAU Y 0 Surg AdenoSI CC Ichilov 6-(20)- M 36 Ic- AdenoSIIA CC GCI 7-GC- 75 M 39 WCAU N 0 Surg AdenoSIIA CC GCI 8-GC- 65 M 44 WCAU N Surg AdenoSIIA CC GCI 9-GC- 65 M 48 WCAU Y 10 Surg AdenoSIIA CC GCI 10-GC- 65 F 50 WCAU N Surg AdenoSIIA CC GCI 11-GC- 65 M 53 WCAU N Surg AdenoSIIA CC Ichilov 12-(22)- F 55 AdenoSII CC GCI 13-GC- 90 M 56 WCAU N Surg AdenoSIIA CC GCI 14-GC- 80 M 58 WCAU Y 4 Surg AdenoSIIA CC GCI 15-GC- 90 M 60 WCAU Y 5 Surg AdenoSIIA CC Asterand 16-As- 60 F 64 CAU occ 1 drink/ 21-30 Auto 2004 AdenoSIIA Week years CC Ichilov 17-(1)- F 66 Ic- AdenoSIIA CC Asterand 19-As- 60 F 67 CAU NU Surg 2004 AdenoSIIA CC GCI 20-GC- 60 F 68 WCAU Y Surg AdenoSIIA CC GCI 21-GC- 90 F 69 WCAU N Surg AdenoSIIA CC Ichilov 22-(28)- F 72 Ic- AdenoSIIA CC Ichilov 23-(10)- M 88 Ic- AdenoSIIA CC Ichilov 24-(14)- F 49 Ic- AdenoSIII CC Ichilov 25-(23)- M 51 Ic- AdenoSIII CC GCI 26-GC- 70 F 54 WCAU N Surg AdenoSIII CC GCI 27-GC- 90 F 54 WCAU N Surg AdenoSIII CC GCI 28-GC- 75 F 61 WCAU N Surg AdenoSIII CC GCI 29-GC- 85 F 64 WCAU N Surg AdenoSIII CC GCI 30-GC- 60 M 67 WCAU Y 14 Surg AdenoSIII CC Ichilov 31-(6)- F 77 Ic- AdenoSIII CC Ichilov 32-(2)- F 89 Ic- AdenoSIII CC Ichilov 33-(11)- NA NA Ic- AdenoSIII CC Asterand 34-As- 76 F 68 CAU NU Surg 2005 AdenoSIIIC CC Ichilov 35-(13)- M 47 Ic- AdenoSIV CC GCI 36-GC- 80 F 52 WCAU Y 3 Surg AdenoSIV CC GCI 37-GC- 85 F 53 WCAU Surg AdenoSIV CC Ichilov 38-(9)- M 62 Ic- AdenoSIV CC Ichilov 71-(16)- 50 F 60 Ic- Adeno CC Ichilov 72-(4)- 75 M 64 Ic- Adeno CC Ichilov 73-(17)- M 73 Ic- Adeno CC Ichilov 74-(5)- F 80 Ic- Adeno CC Ichilov 75-(72)- F 88 Ic- Adeno CC Ichilov 76-(18)- NA NA Ic- Adeno CC Ichilov 78-(21)- NA NA Ic- Adeno CC Ichilov 79-(24)- NA NA Ic- Adeno CC Ichilov 80-(25)- NA NA Ic- Adeno CC biochain 82-(61)- M 41 Bc- Adeno CC biochain 83-(57)- F 45 Bc- Adeno CC biochain 84-(56)- 40 F 48 Bc- Adeno CC biochain 85-(53)- F 53 Bc- Adeno CC biochain 86-(54)- M 53 Bc- Adeno CC biochain 87-(59)- M 58 Bc- Adeno CC biochain 88-(60)- M 67 Bc- Adeno CC biochain 89-(55)- M 68 Bc- Adeno CC biochain 90-(58)- M 73 Bc- Adeno CB GCI 40-GC- F 48 WCAU Y 1 Surg Ben CB GCI 41-GC- F 75 WCAU N Surg Ben CN GCI 42-GC- 0 M 45 WCAU N Surg NPS CN Asterand 43-As- 0 F 46 CAU NU Surg 2002 NPS CN GCI 44-GC- 0 F 48 WCAU Y 1 Surg NPS CN GCI 45-GC- 0 F 50 WCAU N Surg NPS CN Asterand 46-As- 0 F 52 CAU Occ Surg 2005 NPS CN Asterand 47-As- 0 F 52 CAU occ Surg 2005 NPS CN GCI 48-GC- 0 M 52 WCAU N Surg NPS CN Asterand 49-As- 0 M 54 CAU CurU Surg 2005 NPS CN GCI 50-GC- 0 F 55 WCAU N Surg NPS CN GCI 51-GC- 0 M 55 WCAU N Surg NPS CN GCI 52- 0 F 57 WCAU Y 6 Surg GC-NPS CN GCI 53-GC- 0 F 57 WCAU Y 1 Surg NPS CN GCI 54-GC- 0 M 59 WCAU Y 42 Surg NPS CN GCI 55-GC- 0 F 61 WCAU Y 3 Surg NPS CN GCI 56-GC- 0 M 61 WCAU Y Surg NPS CN GCI 57-GC- 0 F 66 WCAU Y 4 Surg NPS CN GCI 58-GC- 0 F 66 WCAU N Surg NPS CN GCI 59-GC- 0 M 68 WCAU N Surg NPS CN GCI 60-GC- 0 F 69 WCAU N Surg NPS CN Asterand 61-As- 0 M 71 CAU Occ Surg 2005 NPS CN Asterand 62-As- 0 F 74 WCAU Occ Surg 2004 NPS CN biochain 63-(65)- M 24 Bc- NPM CN Ambion 64-(71)- F 34 Am-NPM CN biochain 65-(66)- M 58 Bc-NPM CN biochain 66-(63)- M 61 Bc-NPM CN biochain 67-(62)- M 66 Bc- NPM CN biochain 68-(64)- F 68 Bc-NPM CN biochain 69-(41)- M 78 Bc-NPM CN biochain 70-(67)- F&M M (26-78) Bc-NPM & F (53-77).

TABLE 5_1 Key Full Name CC Colon Cancer CB Colon Benign CN Colon Normal WT Weight HT Height Aden Adenocarcinoma AI Adenocarcinoma intramucosal UA Ulcerated adenocarcinoma WP Aden Well polypoid adeocarcinoma MA Mucinus adenocarcinoma TA Tubular adenocarcinoma Carc Carcinoma TS Aden TUBULOVILLOUS ADENOMA TS Aden HGD TUBULOVILLOUS ADENOMA with HIGH GRADE DYSPLASIA NC Normal Colon N-PM Normal PM N-PM P10 Normal PM (Pool 10) Diag Diagnosis Div DIVERTICULITIS Divs w/ F DIV Diverticulosis with Focal DIVERTICULITIS Chr Divs Chronic diverticulosis Divs w/ DIV . . . DIVERTICULOSIS WITH DIVERTICULITIS AND FOCAL ABSCESS FORMATION; NO MALIGNANCY AD w/ AF Acute diverticulosis with abscess formation CU CECAL ULCERATION Divs, PA DIVERTICULOSIS AND PERICOLIC ABSCESS Tub Aden TUBULAR ADENOMA Div/Chr Infl DIVERTICULOSIS/CHRONIC INFLAMMATION Chr Div CHRONIC DIVERTICULITIS Ext Divs EXTENSIVE DIVERTICULOSIS Divs, Chr Div . . . DIVERTICULOSIS AND CHRONIC DIVERTICULITIS, SEROSAL FIBROSIS AND CHRONIC SEROSITIS MU w/ MI Mucosal ulceration with mural inflammation UC Ulcerative colitis Cec cecum Dis C DISTAL COLON Ret, Low Ant RETROSIGMOID, LOW ANTERIOR Rect Col Rectosigmoidal colon Sig col Sigmod colon Col Sig Colon Sigma RT Col RIGHT COLON Prox T Col PROXIMAL TRANSVERSE COLON LT Col Left Colon Gr Grade CS Cancer Stage Ethnic B Ethnic background NU Never Used Occ Occasion Cur U Current use Dr. per day Drinks per day Alc. Dur. Alcohol Duration Auto. Autopsy Surg. Surgical Exc. Y. Excision Year

Materials and Experimental Procedures Used to Obtain Expression Data RNA Preparation—

RNA was obtained from ABS (Wilmington, Del. 19801, USA, http://www.absbioreagents.com), BioChain Inst. Inc. (Hayward, Calif. 94545 USA www.biochain.com), GOG for ovary samples—Pediatic Cooperative Human Tissue Network, Gynecologic Oncology Group Tissue Bank, Children Hospital of Columbus (Columbus Ohio 43205 USA), Clontech (Franklin Lakes, N.J. USA 07417, www.clontech.com), Ambion (Austin, Tex. 78744 USA, http://www.ambion.com), Asternad (Detroit, Mich. 48202-3420, USA, www.asterand.com), and from Genomics Collaborative Inc. a Division of Seracare (Cambridge, Mass. 02139, USA, www.genomicsinc.com). Alternatively, RNA was generated from blood cells, cell lines or tissue samples using TRI-Reagent (Molecular Research Center), according to Manufacturer's instructions. Tissue and RNA samples were obtained from patients or from postmortem. Most total RNA samples were treated with DNaseI (Ambion).

RT PCR—Purified RNA (2-10 μg) was mixed with 300-1500 ng Random Hexamer primers (Invitrogen) and 500 μM dNTP in a total volume of 31.2 to 156 μl. The mixture was incubated for 5 mM at 65° C. and then quickly chilled on ice. Thereafter, 10-50 μl of 5× SuperscriptII first strand buffer (Invitrogen), 4.8 to 24 μl 0.1M DTT and 80-400 units RNasin (Promega) were added, and the mixture was incubated for 10 mM at 25° C., followed by further incubation at 42° C. for 2 mM. Then, 2-10 μl (400-2000 units) of SuperscriptII (Invitrogen) was added and the reaction (final volume of 50-2500 was incubated for 50 mM at 42° C. and then inactivated at 70° C. for 15 min. The resulting cDNA was diluted 1:20 in TE buffer (10 mM Tris pH=8, 1 mM EDTA pH=8).

Real-Time RT-PCR analysis carried out as described below—cDNA (5 μl), prepared as described above, was used as a template in Real-Time PCR reactions (final volume of 20 μl) using the SYBR Green I assay (PE Applied Biosystem) with specific primers and UNG Enzyme (Eurogentech or ABI or Roche). The amplification was effected as follows: 50° C. for 2 min, 95° C. for 10 min, and then 40 cycles of 95° C. for 15 sec, followed by 60° C. for 1 min, following by dissociation step. Detection was performed by using the PE Applied Biosystem SDS 7000. The cycle in which the reactions achieved a threshold level of fluorescence (Ct=Threshold Cycle, described in detail below) was registered and was used to calculate the relative transcript quantity in the RT reactions. The relative quantity was calculated using the equation Q=efficiency^−Ct. The efficiency of the PCR reaction was calculated from a standard curve, created by using different dilutions of several reverse transcription (RT) reactions. To minimize inherent differences in the RT reaction, the resulting relative quantities were normalized using a normalization factor calculated in the following way:

The expression of several housekeeping (HSKP) genes was checked on every panel. The relative quantity (Q) of each housekeeping gene in each sample, calculated as described above, was divided by the median quantity of this gene in all panel samples to obtain the “relative Q rel to MED”. Then, for each sample the median of the “relative Q rel to MED” of the selected housekeeping genes was calculated and served as normalization factor of this sample for further calculations. Schematic summary of quantitative real-time PCR analysis is presented in FIG. 1. As shown, the x-axis shows the cycle number. The CT=Threshold Cycle point, which is the cycle that the amplification curve crosses the fluorescence threshold that was set in the experiment. This point is a calculated cycle number in which PCR products signal is above the background level (passive dye ROX) and still in the Geometric/Exponential phase (as shown, once the level of fluorescence crosses the measurement threshold, it has a geometrically increasing phase, during which measurements are most accurate, followed by a linear phase and a plateau phase; for quantitative measurements, the latter two phases do not provide accurate measurements). The y-axis shows the normalized reporter fluorescence. It should be noted that this type of analysis provides relative quantification.

For each RT sample, the expression of the specific amplicon was normalized to the normalization factor calculated from the expression of different house keeping genes as described in section above.

These house keeping genes are different for each panel. For colon panel—HPRT1 (GenBank Accession No. NM_000194 (SEQ ID NO: 118); amplicon—HPRT1-amplicon (SEQ ID NO:181)), PBGD (GenBank Accession No. BC019323 (SEQ ID NO: 117); amplicon—PBGD-amplicon (SEQ ID NO:178)), and G6PD (GenBank Accession No. NM_000402 (SEQ ID NO: 119); G6PD amplicon (SEQ ID NO: 184)). For lung panel—HPRT1 (GenBank Accession No. NM_000194 (SEQ ID NO: 118); amplicon—HPRT1-amplicon (SEQ ID NO:181)), PBGD (GenBank Accession No. BC019323 (SEQ ID NO: 117); amplicon—PBGD-amplicon (SEQ ID NO:178)), SDHA (GenBank Accession No. NM_004168 (SEQ ID NO: 116); amplicon—SDHA-amplicon (SEQ ID NO:175)) and Ubiquitin (GenBank Accession No. BC000449 (SEQ ID NO: 115); amplicon—Ubiquitin-amplicon (SEQ ID NO: 172)). For ovary panel—SDHA (GenBank Accession No. NM_004168 (SEQ ID NO: 116); amplicon—SDHA-amplicon (SEQ ID NO:175)), HPRT1 (GenBank Accession No. NM_000194 (SEQ ID NO: 118); amplicon—HPRT1-amplicon (SEQ ID NO:181)) and G6PD (GenBank Accession No. NM_000402 (SEQ ID NO: 119); G6PD amplicon (SEQ ID NO: 184)). For normal panel—SDHA (GenBank Accession No. NM_004168 (SEQ ID NO: 116); amplicon—SDHA-amplicon (SEQ ID NO:175)), Ubiquitin (GenBank Accession No. BC000449 (SEQ ID NO: 115); amplicon—Ubiquitin-amplicon (SEQ ID NO: 172)), and TATA box (GenBank Accession No. NM_003194 (SEQ ID NO: 114); TATA amplicon (SEQ ID NO: 169)). For blood panel—HSB1L_HUMAN (Accession No. Q9Y450)(SEQ ID NO: 109), DHSA_HUMAN (SEQ ID NO: 110) (Accession No P31040), SFRS4_HUMAN (SEQ ID NO: 111) (Accession No Q08170) and SLC25A3 (Accession No Q7Z7N7) (SEQ ID NO: 112).

The sequences of the housekeeping genes measured in all the examples of blood panel were as follows:

HSB1L_HUMAN (SEQ ID NO: 109) (Accession No. Q9Y450) T05337_seg30-34F1-Forward primer (SEQ ID NO: 152): GCTCCAGGCCATAAGGACTTC T05337_seg30-34R1 (SEQ ID NO: 153)-Reverse primer: CAGCTTCAAACTCTCCCCTGC Amplicon (SEQ ID NO: 154): GCTCCAGGCCATAAGGACTTCATTCCAAATATGATTACAGGAGCAGCCCA GGCGGATGTAGCTGTTTTAGTTGTAGATGCCAGCAGGGGAGAGTTTGAAG CTG DHSA_HUMAN (SEQ ID NO: 110) (Accession No P31040) M78124_seg45-48F1 (SEQ ID NO: 155)-Forward primer: TTCCTTGCCAGGACCTAGAG M78124_seg45-48R1-Reverse primer (SEQ ID NO: 156): CATAAACCTTTCGCCTTGAC Amplicon (SEQ ID NO: 157): TTCCTTGCCAGGACCTAGAGTTTGTTCAGTTCCACCCCACAGGCATATAT GGTGCTGGTTGTCTCATTACGGAAGGATGTCGTGGAGAGGGAGGCATTCT CATTAACAGTCAAGGCGAAAGGTTTATG SFRS4_HUMAN (SEQ ID NO: 111) (Accession No Q08170) HUMSRP75Aseg30-33F1 (SEQ ID NO: 158)- Forward primer: AATTTGTCAAGTCGGTGCAGC HUMSRP75Aseg30-33R1 (SEQ ID NO: 159)- Reverse primer: TCACCCCTTCATTTTTGCGT Amplicon (SEQ ID NO: 160): AATTTGTCAAGTCGGTGCAGCTGGCAAGACCTAAAGGATTATATGCGTCA GGCAGGAGAAGTGACTTATGCAGATGCTCACAAGGGACGCAAAAATGAAG GGGTGA SLC25A3 (Accession No Q7Z7N7) (SEQ ID NO: 112) SSMPCPseg24-25-29F1- Forward primer (SEQ ID NO: 161): CCCAAAATGTATAAGGAAGAAGGC SSMPCPseg24-25-29R1- Reverse primer (SEQ ID NO: 162): TTCAAAGCAGGCGAACTTCA Amplicon (SEQ ID NO: 163): CAGCCAGGTTATGCCAACACTTTGAGGGATGCAGCTCCCAAAATGTATAA GGAAGAAGGCCTAAAAGCATTCTACAAGGGGGTTGCTCCTCTCTGGATGA GACAGATACCATACACCATGATGAAGTTCGCCTGCTTTGA

The sequences of the housekeeping genes measured in all the examples on normal tissue samples panel were as follows:

TATA box (GenBank Accession No. NM_003194 (SEQ ID NO: 114)), TATA box Forward primer (SEQ ID NO: 167): CGGTTTGCTGCGGTAATCAT TATA box Reverse primer (SEQ ID NO: 168): TTTCTTGCTGCCAGTCTGGAC TATA box -amplicon (SEQ ID NO: 169): CGGTTTGCTGCGGTAATCATGAGGATAAGAGAGCCACGAACCACGGCACT GATTTTCAGTTCTGGGAAAATGGTGTGCACAGGAGCCAAGAGTGAAGAAC AGTCCAGACTGGCAGCAAGAAA Ubiquitin (GenBank Accession No. BC000449 (SEQ ID NO: 115)) Ubiquitn Forward primer (SEQ ID NO: 170): ATTTGGGTCGCGGTTCTTG Ubiquitin Reverse primer (SEQ ID NO: 171): TGCCTTGACATTCTCGATGGT Ubiquitin-amplicon (SEQ ID NO: 172) ATTTGGGTCGCGGTTCTTGTTTGTGGATCGCTGTGATCGTCACTTGACA ATGCAGATCTTCGTGAAGACTCTGACTGGTAAGACCATCACCCTCGAGG TTGAGCCCAGTGACACCATCGAGAATGTCAAGGCA SDHA (GenBank Accession No. NM_004168 (SEQ ID NO: 116)) SDHA Forward primer (SEQ ID NO: 173): TGGGAACAAGAGGGCATCTG SDHA Reverse primer (SEQ ID NO: 174): CCACCACTGCATCAAATTCATG SDHA-amplicon (SEQ ID NO: 175): TGGGAACAAGAGGGCATCTGCTAAAGTTTCAGATTCCATTTCTGCTCAGT ATCCAGTAGTGGATCATGAATTTGATGCAGTGGTGG

The sequences for primers and amplicons of the housekeeping genes measured in all the cancer examples are listed below. For colon panel—HPRT1, PBGD and G6PD were used. For lung panel—PBGD, HPRT1, Ubiquitin and SDHA were used. For ovary panel—HPRT1, SDHA and G6PD were used.

SDHA (GenBank Accession No. NM_004168 (SEQ ID NO: 116):

SDHA Forward primer (SEQ ID NO: 173): TGGGAACAAGAGGGCATCTG SDHA Reverse primer (SEQ ID NO: 174): CCACCACTGCATCAAATTCATG SDHA-amplicon (SEQ ID NO: 175): TGGGAACAAGAGGGCATCTGCTAAAGTTTCAGATTCCATTTCTGCTCAGT ATCCAGTAGTGGATCATGAATTTGATGCAGTGGTGG PBGD (GenBank Accession No. BC019323 (SEQ ID NO: 117)), PBGD Forward primer (SEQ ID NO: 176): TGAGAGTGATTCGCGTGGG PBGD Reverse primer (SEQ ID NO: 177): CCAGGGTACGAGGCTTTCAAT PBGD-amplicon (SEQ ID NO: 178): TGAGAGTGATTCGCGTGGGTACCCGCAAGAGCCAGCTTGCTCGCATACAG ACGGACAGTGTGGTGGCAACATTGAAAGCCTCGTACCCTGG HPRT1 (GenBank Accession No. NM_000194 (SEQ ID NO: 118)), HPRT1 Forward primer (SEQ ID NO: 179): TGACACTGGCAAAACAATGCA HPRT1 Reverse primer (SEQ ID NO: 180): GGTCCTTTTCACCAGCAAGCT HPRT1-amplicon (SEQ ID NO: 181): TGACACTGGCAAAACAATGCAGACTTTGCTTTCCTTGGTCAGGCAGTATA ATCCAAAGATGGTCAAGGTCGCAAGCTTGCTGGTGAAAAGGACC G6PD (GenBank Accession No. NM_000402 (SEQ ID NO: 119)) G6PD Forward primer (SEQ ID NO: 182): gaggccgtcaccaagaacat G6PD Reverse primer (SEQ ID NO: 183): ggacagccggtcagagctc G6PD-amplicon (SEQ ID NO: 184): gaggccgtcaccaagaacattcacgagtcctgcatgagccagataggctg gaaccgcatcatcgtggagaagcccttcgggagggacctgcagagctctg accggctgtcc Ubiquitin (GenBank Accession No. BC000449 (SEQ ID NO: 115)) Ubiquitin Forward primer (SEQ ID NO: 170): ATTTGGGTCGCGGTTCTTG Ubiquitin Reverse primer (SEQ ID NO: 171): TGCCTTGACATTCTCGATGGT Ubiquitin Amplicon (SEQ ID NO: 172): ATTTGGGTCGCGGTTCTTGTTTGTGGATCGCTGTGATCGTCACTTGACAA TGCAGATCTTCGTGAAGACTCTGACTGGTAAGACCATCACCCTCGAGG TTGAGCCCAGTGACACCATCGAGAATGTCAAGGCA

Another methodology used to predict the expression pattern of the proteins of the invention was MED discovery engine:

MED is a platform for collection of public gene-expression data, normalization, annotation and performance of various queries. Expression data from the most widely used Affymetrix microarrays is downloaded from the Gene Expression Omnibus (GEO—www.ncbi.nlm.nih.gov/GEO). Data is multiplicatively normalized by setting the 95 percentile to a constant value (normalized expression=1200), and noise is filtered by setting the lower 30% to 0. Experiments are annotated, first automatically, and then manually, to identify tissue and condition, and chips are grouped according to this annotation, and cross verification of this grouping by comparing the overall expression pattern of the genes of each chip to the overall average expression pattern of the genes in this group. Each probeset in each group is assigned an expression value which is the median of the expressions of that probeset in all chips included in the group. The vector of expression of all probesets within a certain group is the virtual chip of that group, and the collection of all such virtual chips is a virtual panel. The panel (or sub-panels) can be queried to identify probesets with a required behavior (e.g. specific expression in a sub-set of tissues, or differential expression between disease and healthy tissues). These probesets are linked to LEADS contigs and to RefSeqs (http://www.ncbi.nlm.nih.gov/RefSeq/) by probe-level mapping, for further analysis.

The Affymetrix platforms that are downloaded are HG-U95A and the HG-U133 family (A, B, A2.0 and PLUS 2.0). Than three virtual panels were created: U95 and U133 Plus 2.0, based on the corresponding platforms, and U133 which uses the set of common probesets for HG-U133A, HG-U133A2.0 and HG-U133 PLUS 2.0+.

The results of the MED discovery engine are presented in scatter plots. The scatter plot is a compact representation of a given panel (collection of groups). The y-axis is the (normalized) expression and the x-axis describes the groups in the panel. For each group, the median expression is represented by a solid marker., and the expression values of the different chips in the group are represented by small dashes (“-”). The groups are ordered and marked as follows—“Other” groups (e.g. benign, non-cancer diseases, etc.) with a triangle, Treated cells with a square, Normal with a circle, Matched with a cross, and Cancer with a diamond. The number of chips in each group is also written adjacent to it's name.

Example 2 Description for Cluster AI581519

The present invention relates to VSIG1 polypeptides, novel splice variants and diagnostics and therapeutics based thereon.

According to the present invention, Cluster AI581519 (internal ID 72756422) features 10 transcripts and 2 segments of interest, the names for which are given in Tables 6 and 7, respectively. The selected protein variants are given in table 8.

TABLE 6 Transcripts of interest Transcript Name   AI58519_T0 (SEQ ID NO: 1) AI58519_T1 (SEQ ID NO: 2) AI58519_T2 (SEQ ID NO: 3) AI58519_T3 (SEQ ID NO: 4) AI58519_T4 (SEQ ID NO: 5) AI58519_T5 (SEQ ID NO: 6) AI58519_T6 (SEQ ID NO: 7) AI58519_T8 (SEQ ID NO: 8) AI58519_T10 (SEQ ID NO: 9) AI58519_T11 (SEQ ID NO: 10)

TABLE 7 Segments of interest Segment Name AI581519_N7 (SEQ ID NO: 120) AI581519_N9 (SEQ ID NO: 121)

TABLE 8 Proteins of interest Protein Name Corresponding Transcripts AI581519_P3 (SEQ ID AI581519_T0 (SEQ ID NO: 1); AI581519_T1 NO: 11) (SEQ ID NO: 2); AI581519_T2 (SEQ ID NO: 3); AI581519_T3 (SEQ ID NO: 4); AI581519_T4 (SEQ ID NO: 5) AI581519_P4 (SEQ ID AI581519_T5 (SEQ ID NO: 6) NO: 12) AI581519_P5 (SEQ ID AI581519_T6 (SEQ ID NO: 7) NO: 13) AI581519_P7 (SEQ ID AI581519_T8 (SEQ ID NO: 8) NO: 14) AI581519_P9 (SEQ ID AI581519_T10 (SEQ ID NO: 9) NO: 15) AI581519_P10 (SEQ ID AI581519_T11 (SEQ ID NO: 10) NO: 16)

These sequences are variants of the known protein V-set and immunoglobulin domain containing 1 (RefSeq accession identifier NP_872413, synonyms: RP5-889N15.1, 1700062D20Rik, GPA34, MGC44287, dJ889N15.1), referred to herein as the previously known protein.

VSIG1 is a V-set and immunoglobulin domain containing 1 protein also known as glycoprotein A34 (GPA34). This gene was originally identified as a transcript encoding a protein with similarity to the glycoprotein A33 (GPA33), a colon cancer antigen (Scanlan et al. Cancer Immunotherapy 6:2 2006), that has 32% identity to GPA33. The authors showed that A34 mRNA and protein expression is highly tissue-restricted, as it is expressed predominantly in stomach and testis. A34 mRNA and protein expression was also detected in gastric cancers, esophageal carcinomas, and ovarian cancers. In their studies they did not detect A34 in lung, breast or colon carcinomas (Scanlan et al. 2006, Cancer Immunity 6: 2).

A known wild type VSIG1 nucleic acid sequence has been reported in various patent and non-patent literarture references. For example, the sequence of AI581519_P3 (SEQ ID NO:11) is disclosed in WO2004037999, refered there as glycoprotein A34 (GPA34). This PCT application contains the sequence for AI581519_P3 (SEQ ID NO:11) which encodes the A34 antigen disclosed herein. The corresponding antigen A34 is indicated to be expressed in some tested stomach cancers (29%), esophageal (63%) and to be expressed to a much lesser number extent (9%) on tested ovarian cancers. The authors suggest that this antigen may be used for therapy and may be a suitable target for antibody based cancer therapies.

WO9926972 discloses that this antigenic protein may also exhibit immune stimulating or immune suppressing activit such as for the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SCID)), e. g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell populations.

In addition this same protein sequence, as depicted in AI581519_P3 (SEQ ID NO:11) herein, is mentioned in WO9960020 and US2002193567 wherein it is identified as Human secreted protein #62.

A sequence homologous to VSIG1 variant as depicted in AI581519_P4 (SEQ ID NO:12) herein, (containing 2 mismatches corresponding to known SNPs) is disclosed in WO2003027228 application, which purportedly discloses an extensive list of different alleged differentially expressed sequences.

Further, a sequence closely related to AI581519_P5 (SEQ ID NO:13), (containing 2 mismatches corresponding to known SNPs) is disclosed in PCT application WO2004100774, which also teaches many other purported differentially expressed sequences.

Still further, a sequence closely related to AI581519_P7 (SEQ ID NO:14), (containing 1 mismatch corresponding to known SNP) is disclosed in PCT application WO2004048550 which similarly contains an extensive listing of alleged differentially expressed sequences.

According to the present invention, VSIG1 is predicted to be a novel B7 member, based on the presence of an IgV and an IgC2 domain. A large portion of proteins having one domain of each Ig subtype are co-stimulatory molecules. Like other known B7 members, VSIG1 is also a type I membrane protein. In the present invention several alternative spliced variants of VSIG1 were identified, as described below, containing a unique region within the ectodomain. The new variants of VSIG1 were demonstrated in the present invention to be overexpressed in lung adenocarcinoma and ovarian cancer.

MED discovery engine described in Example 1 herein, was used to assess the expression of VSIG1 transcripts. Expression data for Affymetrix probe sets 234370 at representing the VSIG1 gene data is shown in FIG. 2. As evident from the scatter plot, presented in FIG. 2, the expression of VSIG1 transcripts detectable with the above probe sets was higher in lung cancer compared to normal lung samples.

As noted above, cluster AI581519 features 10 transcripts, which were listed in Table 6 above. These transcripts encode for proteins which are variants of protein V-set and immunoglobulin domain containing 1. A description of each variant protein according to the present invention is now provided.

Variant protein AI581519_P3 (SEQ ID NO:11) according to the present invention has an amino acid sequence encoded by transcripts AI581519_T0 (SEQ ID NO:1), AI581519_T1 (SEQ ID NO:2), AI581519_T2 (SEQ ID NO:3), AI581519_T3 (SEQ ID NO:4) and AI581519_T4 (SEQ ID NO:5).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P3 (SEQ ID NO:11) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 9, (given according to their positions on the amino acid sequence, with the alternative amino acids listed).

TABLE 9 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 90 I -> V 127 Q -> 146 S -> R 176 K -> 176 K -> E 181 D -> G 181 D -> V 189 F -> L 195 I -> T 196 L -> 202 T -> 280 T -> 288 S -> G 322 P -> 344 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 10:

TABLE 10 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin C2 type HMMSmart 34-123, 152-218 Immunoglobulin-like Profile 36-120, 140-227 Scan Immunoglobulin-like HMMPfam 36-118, 154-213 Immunoglobulin subtype HMMSmart 28-137, 146-229 Myelin P0 protein FPrintScan 35-59, 110-139 Immunoglobulin V-set HMMPfam 21-137 Immunoglobulin V-type HMMSmart 38-118

Variant protein AI581519_P3 (SEQ ID NO:11) is encoded by the following transcripts: AI581519_T0 (SEQ ID NO:1), AI581519_T1 (SEQ ID NO:2), AI581519_T2 (SEQ ID NO:3), AI581519_T3 (SEQ ID NO:4) and AI581519_T4 (SEQ ID NO:5).

The coding portion of transcript AI581519_T0 (SEQ ID NO:1) starts at position 171 and ends at position 1331. The transcript also has the following SNPs as listed in Table 11 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed (SEQ ID NO:11)).

TABLE 11 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1031, 1032, 2314, 2471 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 1010, 1136 A -> T 712 T -> C 735, 754, 1201 G -> A 1679, 1800, 1867 T -> A 2260 

The coding portion of transcript AI581519_T1 (SEQ ID NO:2) starts at position 171 and ends at position 1331. The transcript also has the following SNPs as listed in Table 12 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 12 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1031, 1032, 2314, 2471 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 1010, 1136 A -> T 712 T -> C 735, 754, 1201 G -> A 1679, 1800, 1867 T -> A 2260 

The coding portion of transcript AI581519_T2 (SEQ ID NO:3) starts at position 171 and ends at position 1331. The transcript also has the following SNPs as listed in Table 13 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 13 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1031, 1032, 1782 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 1010, 1136 A -> T 712 T -> C 735, 754, 1201 G -> A 1679 

The coding portion of transcript AI581519_T3 (SEQ ID NO:4) starts at position 171 and ends at position 1331. The transcript also has the following SNPs as listed in Table 14 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 14 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1031, 1032, 1702 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 1010, 1136 A -> T 712 T -> C 735, 754, 1201 G -> A 1679 

The coding portion of transcript AI581519_T4 (SEQ ID NO:5) starts at position 171 and ends at position 1331. The transcript also has the following SNPs as listed in Table 15 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 15 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1031, 1032 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 1010, 1136 A -> T 712 T -> C 735, 754, 1201

Variant protein AI581519_P4 (SEQ ID NO:12) according to the present invention has an amino acid sequence encoded by transcript AI581519_T5 (SEQ ID NO:6). Alignments to previously published protein sequences are shown in FIG. 3A. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

2. Comparison report between AI581519_P4 (SEQ ID NO:12) and known proteins NP_872413 (SEQ ID NO: 11) and Q86XK7_HUMAN (FIG. 3A):

A. An isolated chimeric polypeptide encoding for AI581519_P4 (SEQ ID NO:12), comprising a first amino acid sequence being at least 90% homologous to

MVFAFWKVFLILSCLAGQVSVVQVTIPDGFVNVTVGSNVTLICIYTTTVA SREQLSIQWSFFHKKEMEPIS corresponding to amino acids 1-71 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 1-71 of AI581519_P4 (SEQ ID NO:12), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence HSSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSE (SEQ ID NO: 284) corresponding to amino acids 72-107 of AI581519_P4 (SEQ ID NO:12), and a third amino acid sequence being at least 90% homologous to

IYFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSGIYICDVNNP PDFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPV YYWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSS CEIDLTSSHPEVGIIVGALIGSLVGAAIIISVVCFARNKAKAKAKERNSK TIAELEPMTKINPRGESEAMPREDATQLEVTLPSSIHETGPDTIQEPDYE PKPTQEPAPEPAPGSEPMAVPDLDIELELEPETQSELEPEPEPEPESEPG VVVEPLSEDEKGVVKA corresponding to amino acids 72-387 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 108-423 of AI581519_P4 (SEQ ID NO:12), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

C. An isolated polypeptide encoding for an edge portion of AI581519_P4 (SEQ ID NO:12), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence HSSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSE (SEQ ID NO: 284) of AI581519_P4 (SEQ ID NO:12).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P4 (SEQ ID NO:12) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 16, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:12)).

TABLE 16 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 87 G -> S 95 V -> A 126 I -> V 163 Q -> 182 S -> R 212 K -> 212 K -> E 217 D -> G 217 D -> V 225 F -> L 231 I -> T 232 L -> 238 T -> 316 T -> 324 S -> G 358 P -> 380 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 17:

TABLE 17 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin C2 type HMMSmart 34-159, 188-254 Immunoglobulin subtype HMMSmart 28-173, 182-265 Immunoglobulin-like Profile 36-156, 176-263 Scan Immunoglobulin V-set HMMPfam 21-173 Immunoglobulin V-type HMMSmart 38-154

Variant protein AI581519_P4 (SEQ ID NO:12) is encoded by the AI581519_T5 (SEQ ID NO:6), for which the coding portion starts at position 171 and ends at position 1439. The transcript also has the following SNPs as listed in Table 18 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 18 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 546, 804, 820, 1139, 1140, 2422, 2579 T -> 275, 865 C -> T 322, 560 G -> A 429, 1787, 1908, 1975 T -> C 454, 843, 862, 1309 C -> A 557, 716 C -> 657 C -> G 716 A -> 804, 884, 1118, 1244 A -> T 820 T -> A 2368 

Variant protein AI581519_P5 (SEQ ID NO:13) according to the present invention has an amino acid sequence encoded by transcript AI581519_T6 (SEQ ID NO:7). Alignments to previously published protein sequences are shown in FIG. 3B. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

2. Comparison report between AI581519_P5 (SEQ ID NO:13) and known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11) (FIG. 3B):

A. An isolated chimeric polypeptide encoding for AI581519_P5 (SEQ ID NO:13), comprising a first amino acid sequence being at least 90% homologous to

MVFAFWKVFLILSCLAGQVSVVQVTIPDGFVNVTVGSNVTLICIYTTTVA SREQLSIQWSFFHKKEMEPIS corresponding to amino acids 1-71 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 1-71 of AI581519_P5 (SEQ ID NO:13), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence

(SEQ ID NO: 285) HSSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSESPWEEGKWPDVEAV KGTLDGQQAELQ corresponding to amino acids 72-133 of AI581519_P5 (SEQ ID NO:13), and a third amino acid sequence being at least 90% homologous to

IYFSQGGQAVAIGQFKDRITGSNDPGNASITISHMQPADSGIYICDVNNP PDFLGQNQGILNVSVLVKPSKPLCSVQGRPETGHTISLSCLSALGTPSPV YYWHKLEGRDIVPVKENFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSS CEIDLTSSHPEVGIIVGALIGSLVGAAIIISVVCFARNKAKAKAKERNSK TIAELEPMTKINPRGESEAMPREDATQLEVTLPSSIHETGPDTIQEPDYE PKPTQEPAPEPAPGSEPMAVPDLDIELELEPETQSELEPEPEPEPESEPG VVVEPLSEDEKGVVKA corresponding to amino acids 72-387 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 134-449 of AI581519_P5 (SEQ ID NO:13), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

C. An isolated polypeptide encoding for an edge portion of AI581519_P5 (SEQ ID NO:13), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence

(SEQ ID NO: 285) HSSCLSTEGMEEKAVGQCLKMTHVRDARGRCSWTSESPWEEGKWPDVEAV KGTLDGQQAELQ of AI581519_P5 (SEQ ID NO:13).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P5 (SEQ ID NO:13) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 19, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:13)).

TABLE 19 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 87 G -> S 95 V -> A 152 I -> V 189 Q -> 208 S -> R 238 K -> 238 K -> E 243 D -> G 243 D -> V 251 F -> L 257 I -> T 258 L -> 264 T -> 342 T -> 350 S -> G 384 P -> 406 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 20:

TABLE 20 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin subtype HMMSmart 28-199, 208-291 Immunoglobulin C2 type HMMSmart 34-185, 214-280 Immunoglobulin V-set HMMPfam  21-199 Immunoglobulin-like Profile 202-289 Scan

Variant protein AI581519_P5 (SEQ ID NO:13) is encoded by the following transcript AI581519_T6 (SEQ ID NO:7), for which the coding portion starts at position 171 and ends at position 1517. The transcript also has the following SNPs as listed in Table 21 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 21 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 624, 882, 898, 1217, 1218, 2500, 2657 T -> 275, 943 C -> T 322, 638 G -> A 429, 1865, 1986, 2053 T -> C 454, 921, 940, 1387 C -> A 635, 794 C -> 735 C -> G 794 A -> 882, 962, 1196, 1322 A -> T 898 T -> A 2446 

Variant protein AI581519_P7 (SEQ ID NO:14) according to the present invention is encoded by transcript AI581519_T8 (SEQ ID NO:8). Alignments to one or more previously published protein sequences are shown in FIG. 3C. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

2. Comparison report between AI581519_P7 (SEQ ID NO:14) and known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11) (FIG. 3C):

A. An isolated chimeric polypeptide encoding for AI581519_P7 (SEQ ID NO:14), comprising a first amino acid sequence being at least 90% homologous to

MVFAFWKVFLILSCLAGQVSVVQVTIPDGFVNVTVGSNVTLICIYTTTVA SREQLSIQWSFFHKKEMEPISIYFSQGGQAVAIGQFKDRITGSNDP corresponding to amino acids 1-96 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 1-96 of AI581519_P7 (SEQ ID NO:14), and a second amino acid sequence being at least 90% homologous to

VKPSKPLCSVQGRPETGHTISLSCLSALGTPSPVYYWHKLEGRDIVPVKE NFNPTTGILVIGNLTNFEQGYYQCTAINRLGNSSCEIDLTSSHPEVGIIV GALIGSLVGAAIIISVVCFARNKAKAKAKERNSKTIAELEPMTKINPRGE SEAMPREDATQLEVTLPSSIHETGPDTIQEPDYEPKPTQEPAPEPAPGSE PMAVPDLDIELELEPETQSELEPEPEPEPESEPGVVVEPLSEDEKGVVKA corresponding to amino acids 138-387 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 97-446 of AI581519_P7 (SEQ ID NO:14), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

C. An isolated chimeric polypeptide encoding for an edge portion of AI581519_P7 (SEQ ID NO:14), comprising a polypeptide having a length “n”, wherein n is at least about 10 amino acids in length, optionally at least about 20 amino acids in length, preferably at least about 30 amino acids in length, more preferably at least about 40 amino acids in length and most preferably at least about 50 amino acids in length, wherein at least two amino acids comprise PV, having a structure as follows: a sequence starting from any of amino acid numbers 96−x to 96; and ending at any of amino acid numbers 97+((n−2)−x), in which x varies from 0 to n−2.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P7 (SEQ ID NO:14) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 22, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:14)).

TABLE 22 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 90 I -> V 105 S -> R 135 K -> 135 K -> E 140 D -> G 140 D -> V 148 F -> L 154 I -> T 155 L -> 161 T -> 239 T -> 247 S -> G 281 P -> 303 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 23:

TABLE 23 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin-like Profile 99-186 Scan Immunoglobulin C2 type HMMSmart 111-177  Immunoglobulin subtype HMMSmart 28-188

Variant protein AI581519_P7 (SEQ ID NO:14) is encoded by the transcript AI581519_T8 (SEQ ID NO:8), for which the coding portion starts at position 171 and ends at position 1208. The transcript also has the following SNPs as listed in Table 24 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 24 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 573, 589, 908, 909, 2191, 2348 T -> 275, 634 C -> T 322, 452 C -> A 449, 485 C -> G 485 A -> 573, 653, 887, 1013 A -> T 589 T -> C 612, 631, 1078 G -> A 1556, 1677, 1744 T -> A 2137

Variant protein AI581519_P9 (SEQ ID NO:15) according to the present invention has an amino acid sequence encoded by transcript AI581519_T10 (SEQ ID NO:9). Alignments to one or more previously published protein sequences are shown in FIG. 3D. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

2. Comparison report between AI581519_P9 (SEQ ID NO:15) and known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11) (FIG. 3D):

A. An isolated chimeric polypeptide encoding for AI581519_P9 (SEQ ID NO:15), comprising a first amino acid sequence being at least 90% homologous to

MVFAFWKVFLILSCLAGQVSVVQVTIPDGFVNVTVGSNVTLICIYTTTVA SREQLSIQWSFFHKKEMEPISIYFSQGGQAVAIGQFKDRITGSNDPGNAS ITISHMQPADSGIYICDVNNPPDFLGQNQGILNVSVLVKPSKPLCSVQGR PETGHTISLSCLSALGTPSPVYYWHKLEGRDIVPVKENF corresponding to amino acids 1-189 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 1-189 of AI581519_P9 (SEQ ID NO:15), and a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence TNHRDFGHWKSDKF (SEQ ID NO: 286) corresponding to amino acids 190-203 of AI581519_P9 (SEQ ID NO:15), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

C. An isolated polypeptide encoding for an edge portion of AI581519_P9 (SEQ ID NO:15), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence TNHRDFGHWKSDKF (SEQ ID NO: 286) of AI581519_P9 (SEQ ID NO:15).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P9 (SEQ ID NO:15) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 25, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:15)).

TABLE 25 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 90 I -> V 127 Q -> 146 S -> R 176 K -> 176 K -> E 181 D -> G 181 D -> V 189 F -> L 195 F -> 202 K ->

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 26:

TABLE 26 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin-like HMMPfam 36-118 Myelin P0 protein FPrintScan 35-59, 110-139 Immunoglobulin-like Profile 36-120 Scan Immunoglobulin subtype HMMSmart 28-137 Immunoglobulin V-set HMMPfam 21-137 Immunoglobulin V-type HMMSmart 38-118

Variant protein AI581519_P9 (SEQ ID NO:15) is encoded by the transcript AI581519_T10 (SEQ ID NO:9), for which the coding portion starts at position 171 and ends at position 779. The transcript also has the following SNPs as listed in Table 27 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 27 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 1029, 1030, 2312, 2469 T -> 275, 755 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 774, 1008, 1134 A -> T 712 T -> C 735, 752, 1199 G -> A 1677, 1798, 1865 T -> A 2258

Variant protein AI581519_P10 (SEQ ID NO:16) according to the present invention has an amino acid sequence as encoded by transcript AI581519_T11 (SEQ ID NO:10). Alignments to previously published protein sequences are shown in FIG. 3E. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

2. Comparison report between AI581519_P10 (SEQ ID NO:16) and known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11) (FIG. 3E):

A. An isolated chimeric polypeptide encoding for AI581519_P10 (SEQ ID NO:16), comprising a first amino acid sequence being at least 90% homologous to

MVFAFWKVFLILSCLAGQVSVVQVTIPDGFVNVTVGSNVTLICIYTTTVA SREQLSIQWSFFHKKEMEPISIYFSQGGQAVAIGQFKDRITGSNDPGNAS ITISHMQPADSGIYICDVNNPPDFLGQNQGILNVSVLVKPSKPLCSVQGR PETGHTISLSCLSALGTPSPVYYWHKLEGRDIVPVKENFNPTTGILVIGN LTNFEQGYYQCTAINRLGNSSCEIDLTSS corresponding to amino acids 1-229 of known proteins NP_872413 and Q86XK7_HUMAN (SEQ ID NO: 11), which also corresponds to amino acids 1-229 of AI581519_P10 (SEQ ID NO:16), and a second amino acid sequence RQ (SEQ ID NO: 287) corresponding to amino acids 230-231 of AI581519_P10 (SEQ ID NO:16), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AI581519_P10 (SEQ ID NO:16) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 28, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:16)).

TABLE 28 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 23 Q -> R 35 V -> 51 S -> F 90 I -> V 127 Q -> 146 S -> R 176 K -> 176 K -> E 181 D -> G 181 D -> V 189 F -> L 195 I -> T 196 L -> 202 T ->

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 29:

TABLE 29 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin C2 type HMMSmart 34-123, 152-218 Immunoglobulin subtype HMMSmart 28-137, 146-229 Immunoglobulin-like HMMPfam 36-118, 154-113 Myelin P0 protein FPrintScan 35-59, 110-139 Immunoglobulin V-type HMMSmart 38-118 Immunoglobulin V-set HMMPfam 21-137 Immunoglobulin-like Profile 36-120, 140-227 Scan

Variant protein AI581519_P10 (SEQ ID NO:16) is encoded by the transcript AI581519_T11 (SEQ ID NO:10), for which the coding portion starts at position 171 and ends at position 863. The transcript also has the following SNPs as listed in Table 30 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 30 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 238, 438, 696, 712, 889, 890, 2172, 2329 T -> 275, 757 C -> T 322, 452 C -> A 449, 608 C -> 549 C -> G 608 A -> 696, 776, 868, 994 A -> T 712 T -> C 735, 754, 1059 G -> A 1537, 1658, 1725 T -> A 2118

According to an optional embodiment of the present invention, short segments related to the above cluster are also provided. These segments are up to about 120bp in length, and so are included in a separate description.

Segment cluster AI581519_N7 (SEQ ID NO:120) according to the present invention is supported by 6 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AI581519_T5 (SEQ ID NO:6) and AI581519_T6 (SEQ ID NO:7). Table 31 below describes the starting and ending position of this segment on each transcript.

TABLE 31 Segment location on transcripts Segment Segment Transcript name starting position ending position AI581519_T5 (SEQ ID NO: 6) 384 491 AI581519_T6 (SEQ ID NO: 7) 384 491

Segment cluster AI581519_N9 (SEQ ID NO:121) according to the present invention is supported by 1 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AI581519_T6 (SEQ ID NO:7). Table 32 below describes the starting and ending position of this segment on each transcript.

TABLE 32 Segment location on transcripts Segment Segment Transcript name starting position ending position AI581519_T6 (SEQ ID NO: 7) 492 569

Expression of V-set and immunoglobulin domain containing 1 (VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7 (SEQ ID NO:190) in normal and cancerous Ovary tissues, in normal and cancerous lung tissues and in different normal tissues

Expression of V-set and immunoglobulin domain containing 1 (VSIG1) transcripts detectable by or according to seg7—AI581519_seg7 (SEQ ID NO:190) amplicon and primers AI581519_seg7F1 (SEQ ID NO: 188) and AI581519_seg7R1 (SEQ ID NO: 189) was measured by real time PCR on ovary panel, lung panel and normal panel. The samples used are detailed in Table 4, Table 3 and Table 2 above, respectively.

Ovary panel—Non-detected samples (samples no. 80, 83, 100 and 109, Table 4) were assigned Ct value of 41 and were calculated accordingly. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 4 is a histogram showing over expression of the above-indicated V-set and immunoglobulin domain containing 1 (VSIG1) transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 4, the expression of V-set and immunoglobulin domain containing 1 (VSIG1) transcripts detectable by the above amplicon in mucinous carcinoma and endometroid samples was significantly higher than in the non-cancerous samples (sample numbers 52-78, Table 4 above). Notably an over-expression of at least 40 fold was found in 4 out of 12 mucinous carcinoma samples and in 3 out of 10 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. Threshold of 40 fold over expression was found to differentiate between mucinous carcinoma and endometroid and normal samples with P value of 6.02e-003 and 1.54e-002, respectively as checked by exact Fisher test. The above values demonstrate statistical significance of the results.

Lung panel—For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70 Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 5 is a histogram showing over expression of the above-indicated V-set and immunoglobulin domain containing 1 (VSIG1) transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 5, the expression of V-set and immunoglobulin domain containing 1 (VSIG1) transcripts detectable by the above amplicon in adenocarcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above). Notably an over-expression of at least 6 fold was found in 11 out of 23 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of V-set and immunoglobulin domain containing 1 (VSIG1) transcripts detectable by the above amplicon in Lung adenocarcinoma samples versus the normal tissue samples was determined by T test as 9.36e-003. Threshold of 6 fold over expression was found to differentiate between adenocarcinoma and normal samples with P value of 8.07e-004 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31, 32, 33 and 34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 6A. The normalized quantity of each RT sample was also divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples (sample numbers 26, 28, 29 and 30, Table 2 above), as shown in FIG. 6B.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI581519_seg7F1 (SEQ ID NO: 188) forward primer; and AI581519_seg7R1 (SEQ ID NO: 189) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI581519_seg7 (SEQ ID NO:190).

Forward Primer >AI581519_seg7F1 (SEQ ID NO: 188): CACAGCTCGTGCCTCAGTACTG Reverse Primer >AI581519_seg7R1  (SEQ ID NO: 189): AGCTACATCTTCCCCGAGCG Amplicon >AI581519_seg7 (SEQ ID NO: 190) CACAGCTCGTGCCTCAGTACTGAGGGTATGGAGGAAAAG GCAGTCGGTCAGTGTCTAAAAATGACGCACGTAAGAGAC GCTCGGGGAAGATGTAGCT

Expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_seg7-9 (SEQ ID NO: 187) in normal and cancerous ovary tissues and in normal and cancerous lung tissues

Expression of V-set and immunoglobulin domain containing 1(VSIG1) transcripts detectable by or according to seg7-9—AI581519_seg7-9 (SEQ ID NO: 187) amplicon and primers AI581519_seg7-9F1 (SEQ ID NO: 185) and AI581519_seg7-9R1 (SEQ ID NO: 186) was measured by real time PCR on ovary panel and lung panel. The samples used are detailed in Table 4 and Table 3 above, respectively.

Ovary panel—Non-detected sample (sample no. 40 Table 4) was assigned Ct value of 41 and was calculated accordingly. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 7 is a histogram showing over expression of the above-indicated V-set and immunoglobulin domain containing 1(VSIG1) transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 7, the expression of V-set and immunoglobulin domain containing 1(VSIG1) transcripts detectable by the above amplicon in mucinous carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 52-78, Table 4 above) and was higher in a few adenocarcinoma samples than in the non-cancerous samples. Notably an over-expression of at least 6 fold was found in 6 out of 9 mucinous carcinoma samples and in 9 out of 37 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. Threshold of 6 fold over expression was found to differentiate between mucinous carcinoma and normal samples with P value of 2.75e-004 as checked by exact Fisher test. Threshold of 6 fold over expression was found to differentiate between adenocarcinoma and normal samples with P value of 2.44e-002 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Lung panel—For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 8 is a histogram showing over expression of the above-indicated V-set and immunoglobulin domain containing 1(VSIG1) transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 8, the expression of V-set and immunoglobulin domain containing 1(VSIG1) transcripts detectable by the above amplicon in adenocarcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above) and was higher in a few non-small cell carcinoma samples than in the non-cancerous samples. Notably an over-expression of at least 17 fold was found in 8 out of 18 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of V-set and immunoglobulin domain containing 1(VSIG1) transcripts detectable by the above amplicon in Lung adenocarcinoma samples versus the normal tissue samples was determined by T test as 1.17e-002.

Threshold of 17 fold over expression was found to differentiate between adenocarcinoma and normal samples with P value of 2.41e-003 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI581519_seg7-9F1 (SEQ ID NO: 185) forward primer; and AI581519_seg7-9R1 (SEQ ID NO: 186) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI581519_seg7-9 (SEQ ID NO: 187).

Forward Primer >AI581519_seg7-9F1 (SEQ ID NO: 185): AATGACGCACGTAAGAGACGC Reverse Primer >AI581519_seg7-9R1 (SEQ ID NO: 186): GAGTGCCCTTCACAGCCTCA Amplicon >AI581519_seg7-9 (SEQ ID NO: 187) AATGACGCACGTAAGAGACGCTCGGGGAAGATGTAGC TGGACCTCTGAGTCTCCTTGGGAGGAGGGGAAGTGGC CAGATGTTGAGGCTGTGAAGGGCACTC

Expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519seg7-9 (SEQ ID NO: 196) in the blood-specific panel.

Expression of VSIG1 transcripts detectable by or according to seg7-9-AI581519seg7-9F3R3 (SEQ ID NO:196) amplicon and primers AI581519seg7-9F3 (SEQ ID NO:194) and AI581519seg7-9R3 (SEQ ID NO:195) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. Non-detected samples (samples no. 28, 33, 83, 85, 90 and 63, Table 1) were assigned Ct value of 41 and were calculated accordingly. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 64, 69-72 and 74-76, Table 1 above), to obtain a value of relative expression of each sample relative to median of the normal samples.

The results of this analysis are depicted in the histogram in FIG. 9. Expression of the above-indicated VSIG1 transcript is high in CD8, CD4 untreated and CD4+CD25− samples but even higher in normal small intestine and normal stomach samples.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: seg7-9F3 forward primer (SEQ ID NO:194); and seg7-9R3 reverse primer (SEQ ID NO:195).

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: seg7-9F3R3 (SEQ ID NO:196).

Forward Primer > AI581519seg7-9F3 (SEQ ID NO: 194) ATGACGCACGTAAGAGACGCTCG Reverse Primer > AI581519seg7-9R3 (SEQ ID NO: 195) GGAGTTCAGCCTGCTGTCCATCAAG Amplicon > AI581519seg7-9F3R3 (SEQ ID NO: 196) ATGACGCACGTAAGAGACGCTCGGGGAAGATGTAGCTGGA CCTCTGAGTCTCCTTGGGAGGAGGGGAAGTGGCCAGATGT TGAGGCTGTGAAGGGCACTCTTGATGGACAGCAGGCTGAA CTCC

Expression of V-set and immunoglobulin domain containing 1(VSIG1) AI581519 transcripts which are detectable by amplicon as depicted in sequence name AI581519_junc7-11F2R2 (SEQ ID NO:193) in normal and cancerous lung tissues, normal and cancerous ovary tissues, different normal tissues and blood-specific panel.

Expression of VSIG1 transcripts detectable by or according to junc7-11F2R2-AI581519_junc7-11F2R2 (SEQ ID NO:193) amplicon and primers AI581519_junc7-11F2 (SEQ ID NO:191) and AI581519_junc7-11R2 (SEQ ID NO:192) was measured by real time PCR on lung panel, ovary panel, normal panel and blood panel. The samples used are detailed in Table 3, Table 4, Table 2 and Table 1 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

For lung panel—Non-detected sample (sample no. 49, Table 3) was assigned Ct value of 41 and was calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (samples numbers 51-64, 69 and 70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 10 is a histogram showing over expression of the above-indicated VSIG1 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 10, the expression of VSIG1 transcripts detectable by the above amplicon in adenocarcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64, 69 and 70, Table 3 above). Notably an over-expression of at least 16 fold was found in 10 out of 23 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of VSIG1 transcripts detectable by the above amplicon in lung adenocarcinoma samples versus the normal tissue samples was determined by T test as 5.13e-003.

Threshold of 16 fold over expression was found to differentiate between adenocarcinoma and normal samples with P value of 1.80e-003 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

For ovary panel—Non-detected samples (samples no. 16, 23, 57, 60-62, 67, 68, 71-74, 77 and 78, Table 4) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52, 53, 55 and 57-67, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 11 is a histogram showing over expression of the above-indicated VSIG1 transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 11, the expression of VSIG1 transcripts detectable by the above amplicon in mucinous carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 52, 53, 55 and 57-67, Table 4 above) and was higher in a few adenocarcinoma samples than in the non-cancerous samples. Notably an over-expression of at least 25 fold was found in 6 out of 9 mucinous carcinoma samples and in 10 out of 37 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. Threshold of 25 fold over expression was found to differentiate between mucinous carcinoma samples and adenocarcinoma and normal samples with P value of 3.95e-004 and 1.88e-002, respectively, as checked by exact Fisher test. The above values demonstrate statistical significance of the results.

For normal panel—Non-detected samples (samples no. 11-20, 28, 30, 32-34, 36, 38-40, 49 and 56, Table 2) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the lung samples (sample numbers 26, and 28-30, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples, as shown in FIG. 12A. The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 12B.

For blood panel—Non-detected samples (samples no. 6, 15-19, 22-24, 27, 29, 40, 41, 46-50, 52-58, 60-64, 71 and 76, Table 1) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the normal samples.

The results of this analysis are depicted in the histogram in FIG. 13. Expression of the above-indicated VSIG1 transcript is very high in CD8, CD4 untreated and CD4+CD25− samples, is high in several lymphomas but also very high in normal small intestine and normal stomach samples.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI581519_junc7-11F2 (SEQ ID NO:191) forward primer; and AI581519_junc7-11R2 (SEQ ID NO:192) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI581519_junc7-11F2R2 (SEQ ID NO:193).

Forward Primer >AI581519_junc7-11F2 (SEQ ID NO: 191) GAAGATGTAGCTGGACCTCTGAGATTTA Reverse Primer >AI581519_junc7-11R2 (SEQ ID NO: 192) GTTGGACCCTGTAATTCGATCTTT Amplicon >AI581519_junc7_11F2R2 (SEQ ID NO: 193) GAAGATGTAGCTGGACCTCTGAGATTTACTTTTCTCAAG GTGGACAAGCTGTAGCCATCGGGCAATTTAAAGATCGAA TTACAGGGTCCAAC

Example 3 Description for Cluster AA424839

The present invention relates to ILDR1 polypeptides, novel splice variants and diagnostics and therapeutics based thereon.

According to the present invention, Cluster AA424839 (internal ID 71418261) features 4 transcripts and 1 segment of interest, the names for which are given in Tables 33 and 34, respectively. The selected protein variants are given in table 35.

TABLE 33 Transcripts of interest Transcript Name AA424839_T0 (SEQ ID NO: 17) AA424839_T2 (SEQ ID NO: 18) AA424839_T4 (SEQ ID NO: 19) AA424839_1_T7 (SEQ ID NO: 20)

TABLE 34 Segments of interest Segment Name AA424839_N18 (SEQ ID NO: 122)

TABLE 35 Proteins of interest Protein Name Corresponding Transcripts AA424839_P3 (SEQ ID NO: 22) AA424839_T0 (SEQ ID NO: 17) AA424839_P5 (SEQ ID NO: 21) AA424839_T2 (SEQ ID NO: 18) AA424839_P7 (SEQ ID NO: 23) AA424839_T4 (SEQ ID NO: 19) AA424839_1_P11 (SEQ ID AA424839_1_T7 (SEQ ID NO: 20) NO: 24)

These sequences are variants of the known protein immunoglobulin-like domain containing receptor 1 (RefSeq accession identifier NP_787120 (SEQ ID NO: 21), also known as ILDR1alpha, ILDR1beta, MGC50831), referred to herein as the previously known protein.

ILDR1, denoted immunoglobulin-like domain containing receptor 1 (SEQ ID NO:21), was described by Hauge et al. (2004) BBRC 323: 970-978, that demonstrated differential expression of the transcripts encoding this protein in indolent follicular lymphoma (FL) and matched transformed diffuse large B cell lymphoma (DLBCL). The gene was identified using a cDNA substraction strategy on patient-matched biopsies of FL and DLBCL. The protein was shown to contain a signal peptide and transmembrane domain, and an Ig domain in the extracellular portion, and it was found to be membrane bound protein, having 31% identity to lipolysis-stimulated remnant receptor (LSR).

According to the present invention, ILDR1 protein and ILDR1 splice variants were predicted to be novel B7/CD28 members. According to the present invention, ILDR1 and ILDR1 splice variants were demonstrated to be overexpressed in ovarian cancer.

MED discovery engine described in Example 1 herein, was used to assess the expression of ILDR1 transcripts. Expression data for Affymetrix probe set 235583_at representing the ILDR1 gene data is shown in FIGS. 14 and 15. As evident from the scatter plot, presented in FIG. 14, the expression of ILDR1 transcripts detectable with the above probe sets was higher in ovarian cancer compared to normal ovary samples. As evident from the scatter plot, presented in FIG. 15, the expression of ILDR1 transcripts detectable with the above probe sets was higher in colon cancer compared to normal colon samples.

As noted above, cluster AA424839 features 4 transcripts, which were listed in Table 33 above. These transcripts encode for proteins which are variants of protein immunoglobulin-like domain containing receptor 1 (SEQ ID NO:21). A description of each protein according to the present invention is now provided.

Variant protein AA424839_P3 (SEQ ID NO:22) according to the present invention has an amino acid sequence as encoded by transcript AA424839_T0 (SEQ ID NO:17). Alignments to previously published protein sequences are shown in FIG. 16A. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between AA424839_P3 (SEQ ID NO:22) and known proteins Q86SU0_HUMAN (SEQ ID NO: 21) and NP_787120 (SEQ ID NO: 21) (FIG. 16A):

A. An isolated chimeric polypeptide encoding for AA424839_P3 (SEQ ID NO:22), comprising a first amino acid sequence being at least 90% homologous to

MAWPKLPAPWLLLCTWLPAGCLSLLVTVQHTERYVTLFASIILKCDYTTS AQLQDVVVTWRFKSFCKDPIFDYYSASYQAALSLGQDPSNDCNDNQREVR IVAQRRGQNEPVLGVDYRQRKITIQNRADLVINEVMWWDHGVYYCTIEAP GDTSGDPDKEVKLIVLHWLTVIFIILGALLLLLLIGVCWCQCCPQYCCCY IRCPCCPAHCCCPEE corresponding to amino acids 1-215 of known proteins Q865U0_HUMAN (SEQ ID NO: 21) and NP_787120 (SEQ ID NO: 21), which also corresponds to amino acids 1-215 of AA424839_P3 (SEQ ID NO:22), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence

(SEQ ID NO: 288) ALARHRYMKQAQALGPQMMGKPLYWGADRSSQVSSYPMHPLLQR corresponding to amino acids 216-259 of AA424839_P3 (SEQ ID NO:22), and a third amino acid sequence being at least 90% homologous to

DLSLPSSLPQMPMTQTTNQPPIANGVLEYLEKELRNLNLAQPLPPDLKGR FGHPCSMLSSLGSEVVERRIIHLPPLIRDLSSSRRTSDSLHQQWLTPIPS RPWDLREGRSHHHYPDFHQELQDRGPKSWALERRELDPSWSGRHRSSRLN GSPIHWSDRDSLSDVPSSSEARWRPSHPPFRSRCQERPRRPSPRESTQRH GRRRRHRSYSPPLPSGLSSWSSEEDKERQPQSWRAHRRGSHSPHWPEEKP PSYRSLDITPGKNSRKKGSVERRSEKDSSHSGRSVVI corresponding to amino acids 216-502 of known proteins Q865U0_HUMAN (SEQ ID NO: 21) and NP_787120 (SEQ ID NO: 21), which also corresponds to amino acids 260-546 of AA424839_P3 (SEQ ID NO:22), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of AA424839_P3 (SEQ ID NO:22), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the

(SEQ ID NO: 288) ALARHRYMKQAQALGPQMMGKPLYWGADRSSQVSSYPMHPLLQR of AA424839_P3 (SEQ ID NO:22).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AA424839_P3 (SEQ ID NO:22) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 36, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:22)).

TABLE 36 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 165 V -> F 165 V -> L 264 P -> R 388 W -> L 388 W -> S 436 H -> N 500 H -> L 500 H -> P 516 D -> Y

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 37.

TABLE 37 InterPro domains Analysis Domain description type Positions on protein Phospholipase A2 ScanReg 205-212 Exp Immunoglobulin subtype HMMSmart  30-166

Variant protein AA424839_P3 (SEQ ID NO:22) is encoded by the transcript AA424839_T0 (SEQ ID NO:17), for which the coding portion starts at position 204 and ends at position 1841. The transcript also has the following SNPs as listed in Table 38 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 38 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> T 696, 1366, 1748, 1749 G -> C 696, 995, 1366 C -> G 994, 1529, 2305 A -> T 1442, 1702 A -> C 1442, 1702 G -> A 1502 C -> A 1509, 1529

Protein AA424839_P5 (SEQ ID NO:21) according to the present invention has an amino acid sequence as encoded by transcript AA424839_T2 (SEQ ID NO:18).

The localization of the protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The protein is believed to be located as follows with regard to the cell: membrane.

Protein AA424839_P5 (SEQ ID NO:21) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 39, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:21)).

TABLE 39 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 165 V -> F 165 V -> L 220 P -> R 344 W -> L 344 W -> S 392 H -> N 456 H -> L 456 H -> P 472 D -> Y

The protein has the following domains, as determined by using InterPro. The domains are described in Table 40:

TABLE 40 InterPro domains Analysis Domain description type Positions on protein Phospholipase A2 ScanReg 205-212 Exp Immunoglobulin subtype HMMSmart  30-166

Protein AA424839_P5 (SEQ ID NO:21) is encoded by the transcript AA424839_T2 (SEQ ID NO:18), for which the coding portion starts at position 204 and ends at position 1709. The transcript also has the following SNPs as listed in Table 41 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 41 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> T 696, 1234, 1616, 1617 G -> C 696, 863, 1234 C -> G 862, 1397, 2173 A -> T 1310, 1570 A -> C 1310, 1570 G -> A 1370 C -> A 1377, 1397

The genomic structure of protein AA424839_P5 (SEQ ID NO:21) (number of exons relevant to the extra-cellular region of the protein, the length of these exons, the frame of the codon in which the introns are inserted and the location of the protein features and domains in the gene structure) is characteristic to the ligands of the B7/co-stimulatory protein family, as given in table 42

TABLE 42 genomic structure and protein features Exon Exon Amino- number Length Acids Protein feature on exon 1 58  1-19 Signal Peptide 2 171 20-76 Ig Domain 3 150  77-126 Ig Domain 4 120 127-166 Ig Domain/Trans-membrane region 5 147 167-215 Trans-membrane region 6 821 216-489 7 39 490-502

Variant protein AA424839_P7 (SEQ ID NO:23) according to the present invention has an amino acid sequence as encoded by transcript AA424839_T4 (SEQ ID NO:19). Alignments to one or more previously published protein sequences are shown in FIG. 16B. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between AA424839_P7 (SEQ ID NO:23) and known proteins Q865U0_HUMAN and NP_787120 (SEQ ID NO: 21) (FIG. 16B):

A. An isolated chimeric polypeptide encoding for AA424839_P7 (SEQ ID NO:23), comprising a first amino acid sequence being at least 90% homologous to

MAWPKLPAPWLLLCTWLPAGCLSLLVTVQHTERYVTLFASIILKCDYTTS AQLQDVVVTWRFKSFCKDPIFDYYSASYQAALSLGQDPSNDCNDNQREVR IVAQRRGQNEPVLGVDYRQRKITIQN corresponding to amino acids 1-126 of known proteins Q865U0_HUMAN and NP_787120 (SEQ ID NO: 21), which also corresponds to amino acids 1-126 of AA424839_P7 (SEQ ID NO:23), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99%, homologous to a polypeptide having the sequence PLARHRYMKQAQALGPQMMGKPLYWGADRSSQVSSYPMHPLLQR corresponding to amino acids 127-170 of AA424839_P7 (SEQ ID NO:23), and a third amino acid sequence being at least 90% homologous to

DLSLPSSLPQMPMTQTTNQPPIANGVLEYLEKELRNLNLAQPLPPDLKGR FGHPCSMLSSLGSEVVERRIIHLPPLIRDLSSSRRTSDSLHQQWLTPIPS RPWDLREGRSHHHYPDFHQELQDRGPKSWALERRELDPSWSGRHRSSRLN GSPIHWSDRDSLSDVPSSSEARWRPSHPPFRSRCQERPRRPSPRESTQRH GRRRRHRSYSPPLPSGLSSWSSEEDKERQPQSWRAHRRGSHSPHWPEEKP PSYRSLDITPGKNSRKKGSVERRSEKDSSHSGRSVVI corresponding to amino acids 216-502 of known proteins Q865U0_HUMAN and NP_787120 (SEQ ID NO: 21), which also corresponds to amino acids 171-457 of AA424839_P7 (SEQ ID NO:23), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: secreted.

Variant protein AA424839_P7 (SEQ ID NO:23) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 43, (given according to their positions on the amino acid sequence, with the alternative amino acids listed).

TABLE 43 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 175 P -> R 299 W -> L 299 W -> S 347 H -> N 411 H -> L 411 H -> P 427 D -> Y

Variant protein AA424839_P7 (SEQ ID NO:23) is encoded by the transcript AA424839_T4 (SEQ ID NO:19), for which the coding portion starts at position 204 and ends at position 1574. The transcript also has the following SNPs as listed in Table 44 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 44 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> G  727, 1262, 2038 G -> C  728, 1099 G -> T 1099, 1481, 1482 A -> T 1175, 1435 A -> C 1175, 1435 G -> A 1235 C -> A 1242, 1262

Variant protein AA424839_1_P11 (SEQ ID NO:24) according to the present invention has an amino acid sequence as encoded by transcript AA424839_1_T7 (SEQ ID NO:20). Alignments to one or more previously published protein sequences are given in FIG. 16C.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein AA424839_1_P11 (SEQ ID NO:24) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 45, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:24)).

TABLE 45 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 108 V -> F 108 V -> L 207 P -> R 331 W -> L 331 W -> S 379 H -> N 443 H -> L 443 H -> P 459 D -> Y

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 46:

TABLE 46 InterPro domains Analysis Domain description type Positions on protein Phospholipase A2 ScanReg 148-155 Exp Immunoglobulin subtype HMMSmart  30-109

Variant protein AA424839_1_P11 (SEQ ID NO:24) is encoded by the transcript AA424839_1_T7 (SEQ ID NO:20), for which the coding portion starts at position 204 and ends at position 1670. The transcript also has the following SNPs as listed in Table 47 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 47 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> T 525, 1195, 1577, 1578 G -> C 525, 824, 1195 C -> G 823, 1358, 2134 A -> T 1271, 1531 A -> C 1271, 1531 G -> A 1331 C -> A 1338, 1358

Segment cluster AA424839_N18 (SEQ ID NO:122) according to the present invention is supported by 10 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AA424839_T0 (SEQ ID NO:17), AA424839_T2 (SEQ ID NO:18), AA424839_T4 (SEQ ID NO:19) and AA424839_1_T7 (SEQ ID NO:20). Table 48 below describes the starting and ending position of this segment on each transcript.

TABLE 48 Segment location on transcripts Segment Segment Transcript name starting position ending position AA424839_T0 (SEQ ID 1173 1802 NO: 17) AA424839_T2 (SEQ ID 1041 1670 NO: 18) AA424839_T4 (SEQ ID 906 1535 NO: 19) AA424839_1_T7 (SEQ ID 1002 1631 NO: 20)

Expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg18wt (SEQ ID NO:199) in normal and cancerous Ovary tissues and in different normal tissues

Expression of immunoglobulin-like domain containing receptor 1 (ILDR1) transcripts detectable by or according to seg18wt—AA424839_seg18wt (SEQ ID NO:199) amplicon and primers AA424839_seg18wtF1 (SEQ ID NO:197) and AA424839_seg18wtR1 (SEQ ID NO:198) was measured by real time PCR on ovary panel and normal panel. The samples used are detailed in Table 4 and Tabel 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Ovary panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 17 is a histogram showing over expression of the above-indicated immunoglobulin-like domain containing receptor 1 (ILDR1) transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 17, the expression of immunoglobulin-like domain containing receptor 1 (ILDR1) transcripts detectable by the above amplicon in serous carcinoma, mucinous carcinoma and adenocarcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 52-78, Table 4 above). Notably an over-expression of at least 25 fold was found in 36 out of 37 adenocarcinoma samples, specifically in 17 out of 18 serous carcinoma samples, in 9 out of 9 mucinous carcinoma samples and in 10 out of 10 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of immunoglobulin-like domain containing receptor 1 (ILDR1) transcripts detectable by the above amplicon in Ovary adenocarcinoma samples, serous carcinoma samples mucinous carcinoma and endometriod versus the normal tissue samples was determined by T test as 3.85e-010, 6.21e-005, 1.10e-003 and 2.94e-004 respectively.

Threshold of 25 fold over expression was found to differentiate between adenocarcinoma, serous carcinoma, mucinous carcinoma, endometriod and normal samples with P value of 3.31e-017, 1.63e-011, 1.06e-008 and 2.87e-009, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31, 32, 33 and 34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 18.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AA424839_seg18wtF1 (SEQ ID NO:197) forward primer; and AA424839_seg18wtR1 (SEQ ID NO:198) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AA424839_seg18wt (SEQ ID NO:199).

Forward Primer >AA424839_seg18wtF1  (SEQ ID NO: 197) AGCCACCACCATTACCCTGA Reverse Primer >AA424839_seg18wtR1  (SEQ ID NO: 198) TGCCTTCCACTCCACGATG Amplicon >AA424839_segl8wt  (SEQ ID NO: 199) AGCCACCACCATTACCCTGATTTCCACCAGGAGCTCCA GGACCGGGGGCCAAAGTCTTGGGCATTGGAAAGAAGGG AGTTGGACCCATCGTGGAGTGGAAGGCA

Expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg14-16 (SEQ ID NO: 202) in normal and cancerous ovary tissues or different normal tissues

Expression of ILDR1 transcripts detectable by or according to seg14-16-AA424839_seg14-16 (SEQ ID NO: 202) amplicon and primers AA424839_seg14-16F1 (SEQ ID NO:200) and AA424839_seg14-16R1 (SEQ ID NO:201) was measured by real time PCR on ovary panel or normal panel. The samples used are detailed in Table 4 and in table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

For ovary panel—the normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-55, 58, 59, 63-69 and 71-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 19 is a histogram showing over expression of the above-indicated ILDR1 transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 19, the expression of ILDR1 transcripts detectable by the above amplicon in serous carcinoma, mucinous carcinoma and adenocarcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 52-55, 58, 59, 63-69 and 71-78, Table 4 above). Notably an over-expression of at least 14 fold was found in 33 out of 37 adenocarcinoma samples: 14 out of 18 serous carcinoma samples, in 9 out of 9 mucinous carcinoma samples and in 10 out of 10 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below.

The P value for the difference in the expression levels of ILDR1 transcripts detectable by the above amplicon in ovary adenocarcinoma samples, ovary serous carcinoma samples, ovary mucinous carcinoma samples and ovary endometroid samples versus the normal tissue samples was determined by T test as 1.00e-008, 4.79e-004, 4.97e-004 and 6.93e-005, respectively.

Threshold of 14 fold over expression was found to differentiate between adenocarcinoma, serous carcinoma, mucinous carcinoma, endometriod and normal samples with P value of 3.78e-012, 2.03e-007, 6.99e-008 and 2.25e-008, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

For normal panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AA424839_seg14-16F1 (SEQ ID NO:200) forward primer; and AA424839_seg14-16R1 (SEQ ID NO:201) reverse primer.

The results demonstrating the expression of ILDR1 AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg14-16 (SEQ ID NO: 202) in different normal tissues are presented in FIG. 20.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AA424839_seg14-16F1R1 (SEQ ID NO:202).

Forward Primer >AA424839_seg14-16F1 (SEQ ID NO: 200) GCCACCGCTACATGAAGCA Reverse Primer >AA424839_seg14-16R1  (SEQ ID NO: 201) CTGGACGGCAGGGACAAAT Amplicon >AA424839_seg14-16F1R1 (SEQ ID NO: 202) GCCACCGCTACATGAAGCAGGCCCAGGCCCTAGGTCCT CAGATGATGGGAAAACCCCTGTACTGGGGGGCGGACAG GAGCTCCCAGGTTTCATCTTATCCAATGCACCCGCTGC TGCAGCGAGATTTGTCCCTGCCGTCCAG

Expression of immunoglobulin-like domain containing receptor 1 (ILDR1) AA424839 transcripts which are detectable by amplicon as depicted in sequence name AA424839_seg11-14F3R3 (SEQ ID NO:205) in the blood-specific panel.

Expression of ILDR1 transcripts detectable by or according to seg11-14-AA424839seg11-14F3R3 (SEQ ID NO: 205) amplicon and primers AA424839seg11-14F3 (SEQ ID NO: 203) and AA424839seg11-14R3 (SEQ ID NO: 204) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 21. Expression of the above-indicated ILDR1 transcript was seen in several lymphomas and cell lines, however the expression was as high as in kidney normal samples.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: seg11-14F3 forward primer; and seg11-14R3 reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: seg11-14F3R3.

Forward Primer > AA424839_seg11-14F3 (SEQ ID NO: 203) TCCTCCTCCTGCTGCTGATTG Reverse Primer > AA424839_seg11-14R3 (SEQ ID NO: 204) TGGGCCTGCTTCATGTAGCG Amplicon > AA424839_seg11-14F3R3  (SEQ ID NO: 205) TCCTCCTCCTGCTGCTGATTGGAGTGTGCTGGTGCCAG TGCTGTCCTCAGTATTGCTGCTGCTATATCCGCTGTCC CTGCTGTCCTGCCCACTGCTGCTGTCCTGAGGAAGCCC TGGCCCGCCACCGCTACATGAAGCAGGCCCA

Example 4 Description for Cluster AI216611

The present invention relates in particular to a putative B7/CD28 member referred to as AI216611 and diagnostics and therapeutics based thereon. According to the present invention, Cluster AI216611 (internal ID 70605934) features 2 transcripts and 3 segments of interest, the names for which are given in Tables 49 and 50, respectively. The selected proteins are given in table 51.

TABLE 49 Transcripts of interest Transcript Name AI216611_T0 (SEQ ID NO: 41) AI216611_T1 (SEQ ID NO: 42)

TABLE 50 Segments of interest Segment Name AI216611_N2 (SEQ ID NO: 126) AI216611_N4 (SEQ ID NO: 127) AI216611_N6 (SEQ ID NO: 128)

TABLE 51 Proteins of interest Protein Name Corresponding Transcripts AI216611_P0 (SEQ ID NO: 43) AI216611_T0 (SEQ ID NO: 41) AI216611_P1 (SEQ ID NO: 44) AI216611_T1 (SEQ ID NO: 42)

AI216611 is an uncharacterized gene having no full length mRNA deposited in Genbank. The protein corresponding to AI216611_P0 appears in Celera's annotation of the human genome, based on computational analysis and translation of the genome (DNA sequence accession CH471065) (Venter, J. C et al., 2001 Science 291, 1304-1351). The protein corresponding to AI216611_P0 is also listed among other sequences disclosed in WO2003025148. However, this application does not characterize its function or more particularly teach that it is a B7/CD28 costimulatory protein.

The protein corresponding to AI216611_P1 sequence is a novel protein, that is only partially similar (186 out of 199 amino acids are the same) to a polypeptide reported in WO205108415, assigned to Biogen-Idec, which purports that this polypeptide is a transmembrane protein that may be targeted in the treatment of hyperproliferative disorders. WO205108415 does not report a function of this polypeptide. More specifically, there is no indication that it is a B7/CD28 costimulatory protein.

According to the present invention, AI216611 is predicted to be a novel B7/CD28 family member based on the presence of an IgV domain, a characteristic structural feature of the B7/CD28 family members. In addition, AI216611 is similar to the known CD28 family members in its exons' sizes and the position of the IgV and transmembrane domains within these exons. Like all known B7/CD28 members, AI216611 is also a type I membrane protein. According to the present invention, two alternatively spliced transcripts of AI216611 are provided, each one containing a unique region within the intracellular region. The expression of AI216611 and its variants was demonstrated in the present invention to be downregulated in colon cancer, further supporting an immune costimulatory role.

As noted above, contig AI216611 features 2 transcripts, which were listed in Table 49 above. A description of each protein according to the present invention is now provided.

Protein AI216611_P0 (SEQ ID NO:43) according to the present invention has an amino acid sequence as encoded by transcript AI216611_T0 (SEQ ID NO:41).

The localization of the protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Protein AI216611_P0 (SEQ ID NO:43) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 52, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:43)).

TABLE 52 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 27 R -> L 27 R -> P 76 Q -> R 129 S -> R

The protein has the following domains, as determined by using InterPro. The domains are described in Table 53:

TABLE 53 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin V-set HMMPfam 33-129 domain Immunoglobulin subtype HMMSmart 40-129

Protein AI216611_P0 (SEQ ID NO:43) is encoded by the transcript AI216611_T0 (SEQ ID NO:41), for which the coding portion starts at position 1 and ends at position 600. The transcript also has the following SNPs as listed in Table 54 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 54 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> T 80, 991 G -> C 80, 991 C -> T 387, 990 C -> G 387, 1205 T -> C 906, 1063, 1090 A -> G 227, 1109 T -> A 1409, 1448 G -> A 1447 T -> G 1448

The genomic structure of protein AI216611_P0 (SEQ ID NO:43) (number of exons relevant to the extra-cellular region of the protein, the length of these exons, the frame of the codon in which the introns are inserted and the location of the protein features and domains in the gene structure) is characteristic to the receptors of the B7/co-stimulatory protein family, as given in table 55

TABLE 55 genomic structure and protein features Exon Exon Amino- number Length Acids Protein feature on exon 1 91  1-30 Signal Peptide 2 327  31-139 Ig-like domain 3 141 140-186 Trans-membrane region 4 41 187-200

Protein AI216611_P1 (SEQ ID NO:44) according to the present invention has an amino acid sequence as encoded by transcript AI216611_T1 (SEQ ID NO:42).

The localization of the protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Protein AI216611_P1 (SEQ ID NO:44) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 56, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:44)).

TABLE 56 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 27 R -> L 27 R -> P 76 Q -> R 129 S -> R

The protein has the following domains, as determined by using InterPro. The domains are described in Table 57:

TABLE 57 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin V-set HMMPfam 33-129 domain Immunoglobulin subtype HMMSmart 40-129

Protein AI216611_P1 (SEQ ID NO:44) is encoded by the transcript AI216611_T1 (SEQ ID NO:42), for which the coding portion starts at position 1 and ends at position 597. The transcript also has the following SNPs as listed in Table 58 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 58 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> T 80, 1121 G -> C 80, 1121 C -> T 387, 1120 C -> G 387, 1335 T -> C 1036, 1193, 1220 A -> G 227, 1239 T -> A 1549, 1578 G -> A 1577 T -> G 1578

As noted above, cluster AI216611 features 3 segments, which were listed in Table 50 above. These segments are portions of nucleic acid sequences which are described herein separately because they are of particular interest. A description of each segment according to the present invention is now provided.

Segment cluster AI216611_N2 (SEQ ID NO:126) according to the present invention is supported by 4 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AI216611_T0 (SEQ ID NO:41) and AI216611_T1 (SEQ ID NO:42). Table 59 below describes the starting and ending position of this segment on each transcript.

TABLE 59 Segment location on transcripts Segment Segment Transcript name starting position ending position AI216611_T0 (SEQ ID NO: 41) 92 418 AI216611_T1 (SEQ ID NO: 42) 92 418

Segment cluster AI216611_N4 (SEQ ID NO:127) according to the present invention is supported by 4 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AI216611_T0 (SEQ ID NO:41) and AI216611_T1 (SEQ ID NO:42). Table 60 below describes the starting and ending position of this segment on each transcript.

TABLE 60 Segment location on transcripts Segment Segment Transcript name starting position ending position AI216611_T0 (SEQ ID NO: 41) 419 559 AI216611_T1 (SEQ ID NO: 42) 419 559

Segment cluster AI216611_N6 (SEQ ID NO:128) according to the present invention is supported by 2 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: AI216611_T0 (SEQ ID NO:41) and AI216611_T1 (SEQ ID NO:42). Table 61 below describes the starting and ending position of this segment on each transcript.

TABLE 61 Segment location on transcripts Segment Segment Transcript name starting position ending position AI216611_T0 (SEQ ID NO: 41) 560 885 AI216611_T1 (SEQ ID NO: 42) 690 1015

Expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc4-6F2R2 (SEQ ID NO: 208) in normal and cancerous Colon tissues and in different normal tissues

Expression of AI216611 transcripts detectable by or according to junc4-6-AI216611_junc4-6F2R2 (SEQ ID NO: 208) amplicon and primers AI216611_junc4-6F2 (SEQ ID NO: 206) and AI216611_junc4-6R2 (SEQ ID NO: 207) was measured by real time PCR on colon panel and normal panel. The samples used are detailed in Table 5 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Colon panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 22 is a histogram showing down regulation of the above-indicated AI216611 transcripts in cancerous Colon samples relative to the normal samples.

As is evident from FIG. 22, the expression of AI216611 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-70, Table 5 above). Notably down regulation of at least 5 fold was found in 27 out of 55 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of AI216611 transcripts detectable by the above amplicon in Colon cancer samples versus the normal tissue samples was determined by T test as 2.39e-005.

Threshold of 5 fold down regulation was found to differentiate between cancer and normal samples with P value of 5.09e-007 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—Non-detected samples (samples no. 50, 52, 54 and 56, Table 2) were assigned Ct value of 41 and were calculated accordingly. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the colon samples (sample numbers 3, 4 and 5, Table 2 above), to obtain a value of relative expression of each sample relative to median of the colon samples, as shown in FIG. 23.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI216611_junc4-6F2 (SEQ ID NO: 206) forward primer; and AI216611_junc4-6R2 (SEQ ID NO: 207) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI216611_junc4-6F2R2 (SEQ ID NO: 208).

Forward Primer >AI216611_junc4-6F2  (SEQ ID NO: 206): CCGCAGTATTAATCAGCCTCATG Reverse Primer >AI216611_junc4-6R2  (SEQ ID NO: 207): AATCTCCTCAGTTGTGCTTTCTTTG Amplicon >AI216611_junc4-6F2R2  (SEQ ID NO: 208) CCGCAGTATTAATCAGCCTCATGTGGGTTTGTAAT AAGTGTGCATATAAATTTCAGAGGAAGAGAAGACA CAAACTCAAAGAAAGCACAACTGAGGAGATT

Expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_seg2WT (SEQ ID NO: 211) in normal and cancerous Colon tissues and in different normal tissues

Expression of AI216611 transcripts detectable by or according to seg2WT-AI216611_seg2WT (SEQ ID NO: 211) amplicon and primers AI216611_seg2WTF1 (SEQ ID NO: 209) and AI216611_seg2WTR1 (SEQ ID NO: 210) was measured by real time PCR on colon panel and normal panel. The samples used are detailed in Table 5 and Table 2 above, respectively.

Colon panel—Non-detected sample (sample no. 33, Table 5) was assigned Ct value of 41 and was calculated accordingly. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 24 is a histogram showing down regulation of the above-indicated AI216611 transcripts in cancerous Colon samples relative to the normal samples.

As is evident from FIG. 24, the expression of AI216611 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-70, Table 5 above). Notably down regulation of at least 5 fold was found in 25 out of 55 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. Threshold of 5 fold down regulation was found to differentiate between cancer and normal samples with P value of 2.00e-006 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of these house keeping genes as described in Example 1, herein. The normalized quantity of each RT sample was then divided by the median of the quantities of the colon samples (sample numbers 3, 4 and 5, Table 5 above), to obtain a value of relative expression of each sample relative to median of the colon samples, as shown in FIG. 25.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI216611_seg2WTF1 (SEQ ID NO: 209) forward primer; and AI216611_seg2WTR1 (SEQ ID NO: 210) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI216611_seg2WT (SEQ ID NO: 211).

Forward Primer >AI216611_seg2WTF1  (SEQ ID NO: 209) GAACGCAGAAGATCGTGGAGT Reverse Primer >AI216611_seg2WTR1  (SEQ ID NO: 210) CTGAAGAGCTGGATGGAGCC Amplicon >AI216611_seg2WT  (SEQ ID NO: 211) GAACGCAGAAGATCGTGGAGTGGAAACCAGGGACTCA GGCCAACATCTCTCAAAGCCACAAGGACAGAGTCTGC ACCTTTGACAACGGCTCCATCCAGCTCTTCAG

Expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611junc4-6 (SEQ ID NO: 214) in the blood-specific panel.

Expression of AI216611 transcripts detectable by or according to junc4-6-junc4-6F4R4 (SEQ ID NO: 214) amplicon and primers junc4-6F4 (SEQ ID NO: 212) and junc4-6R4 (SEQ ID NO: 213) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 26. Expression of the above-indicated AI216611 transcript was much high in normal samples checked relative to the different blood specific samples in the panel.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: junc4-6F4 (SEQ ID NO: 212) forward primer; and junc4-6R4 reverse primer (SEQ ID NO: 213).

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: junc4-6F4R4 (SEQ ID NO: 214).

Forward Primer AI216611junc4-6F4  (SEQ ID NO: 212): CTGCACTTTGTCGCTGTCATC Reverse Primer AI216611junc4-6R4 (SEQ ID NO: 213): CAATCTCCTCAGTTGTGCTTTCTTTG Amplicon AI216611junc4-6F4R4 (SEQ ID NO: 214): CTGCACTTTGTCGCTGTCATCCTTGCTTTTCTCGCTG CTGTGGCCGCAGTATTAATCAGCCTCATGTGGGTTTG TAATAAGTGTGCATATAAATTTCAGAGGAAGAGAAGA CACAAACTCAAAGAAAGCACAACTGAGGAGATTG

Expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611junc2-4seg5 (SEQ ID NO: 217) in the blood-specific panel.

Expression of AI216611 transcripts detectable by or according to junc2-4seg5-junc2-4seg5F3R4 amplicon (SEQ ID NO: 217) and primers junc2-4seg5F3 (SEQ ID NO: 215) and junc2-4seg5R4 (SEQ ID NO: 216) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 27. Expression of the above-indicated AI216611 transcript was much high in normal samples checked relative to the different blood specific samples in the panel.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: junc2-4seg5F3 (SEQ ID NO: 215) forward primer; and junc2-4seg5R4 (SEQ ID NO: 216) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: junc2-4seg5F3R4 (SEQ ID NO: 217).

Forward Primer AI216611 junc2-4seg5F3  (SEQ ID NO: 215): TGCTGCACGTCTCTGAGATCC Reverse Primer AI216611 junc2-4seg5R4 (SEQ ID NO: 216): CACCTCTGGCCTCAAAACCACTC Amplicon > AI216611 junc2-4seg5F3R4 (SEQ ID NO: 217) TGCTGCACGTCTCTGAGATCCTCTATGAAGACCTGCAC TTTGTCGCTGTCATCCTTGCTTTTCTCGCTGCTGTGGC CGCAGTATTAATCAGCCTCATGTGGGTTTGTAATAAGT GTGCATATAAATTTCAGAGGAAGAGAAGACACAAACTC AAAGGTAACCCCCTGGGCCTTGTGATAATCCATGAGTG GTTTTGAGGCCAGAGGTG

Expression of AI216611 transcripts which are detectable by amplicon as depicted in sequence name AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220) in normal and cancerous Colon tissues

Expression of AI216611 transcripts detectable by or according to junc2-4seg5F2R2—AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220) amplicon and primers AI216611_junc2-4seg5F2 (SEQ ID NO: 218) and AI216611_junc2-4seg5R2 (SEQ ID NO: 219) was measured by real time PCR on colon panel. The samples used are detailed in Table 5 above. Non-detected samples (samples no. 28, 33, 83, 85, 90 and 63, Table 5) were assigned Ct value of 41 and were calculated accordingly. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-62 and 64-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 28 is a histogram showing down regulation of the above-indicated AI216611 transcripts in cancerous Colon samples relative to the normal samples.

As is evident from FIG. 28, the expression of AI216611 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-62 and 64-70, Table 5 above). Notably down regulation of at least 5 fold was found in 31 out of 55 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of AI216611 transcripts detectable by the above amplicon in Colon cancer samples versus the normal tissue samples was determined by T test as 5.29e-003.

Threshold of 5 fold down regulation was found to differentiate between cancer and normal samples with P value of 4.18e-008 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: AI216611_junc2-4seg5F2 (SEQ ID NO: 218) forward primer; and AI216611_junc2-4seg5R2 (SEQ ID NO: 219) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220).

Forward Primer >AI216611_junc2-4seg5F2 (SEQ ID NO: 218) GCTGCACGTCTCTGAGATCCT Reverse Primer >AI216611_junc2-4seg5R2  (SEQ ID NO: 219) CACCTCTGGCCTCAAAACCA Amplicon >AI216611_junc2-4seg5F2R2 (SEQ ID NO: 220) GCTGCACGTCTCTGAGATCCTCTATGAAGACCTGCACTT TGTCGCTGTCATCCTTGCTTTTCTCGCTGCTGTGGCCGC AGTATTAATCAGCCTCATGTGGGTTTGTAATAAGTGTGC ATATAAATTTCAGAGGAAGAGAAGACACAAACTCAAAGG TAACCCCCTGGGCCTTGTGATAATCCATGAGTGGTTTTG AGGCCAGAGGTG

Example 5 Description for Cluster H68654_1

The present invention relates to LOC253012 polypeptides, novel splice variants and diagnostics and therapeutics based thereon.

According to the present invention, Cluster H68654_1 (internal ID 76432882) features 10 transcripts and 3 segments of interest, the names for which are given in Tables 62 and 63, respectively. The selected protein variants are given in table 64.

TABLE 62 Transcripts of interest Transcript Name H68654_1_T0 (SEQ ID NO: 25) H68654_1_T4 (SEQ ID NO: 26) H68654_1_T5 (SEQ ID NO: 27) H68654_1_T8 (SEQ ID NO: 28) H68654_1_T15 (SEQ ID NO: 29) H68654_1_T16 (SEQ ID NO: 30) H68654_1_T17 (SEQ ID NO: 31) H68654_1_T18 (SEQ ID NO: 32) H68654_1_T19 (SEQ ID NO: 33) H68654_1_T20 (SEQ ID NO: 34)

TABLE 63 Segments of interest Segment Name H68654_1_N3 (SEQ ID NO: 123) H68654_1_N7 (SEQ ID NO: 124) H68654_1_N12 (SEQ ID NO: 125)

TABLE 64 Proteins of interest Protein Name Corresponding Transcripts H68654_1_P2 (SEQ ID H68654_1_T0 (SEQ ID NO: 25); NO: 35) H68654_1_T5 (SEQ ID NO: 27) H68654_1_P5 (SEQ ID H68654_1_T4 (SEQ ID NO: 26) NO: 36) H68654_1_P7 (SEQ ID H68654_1_T8 (SEQ ID NO: 28) NO: 37) H68654_1_P12 (SEQ ID H68654_1_T15 (SEQ ID NO: 29); NO: 38) H68654_1_T16 (SEQ ID NO: 30); H68654_1_T18 (SEQ ID NO: 32) H68654_1_P13 (SEQ ID H68654_1_T17 (SEQ ID NO: 31); NO: 39) H68654_1_T19 (SEQ ID NO: 33) H68654_1_P14 (SEQ ID H68654_1_T20 (SEQ ID NO: 34) NO: 40)

These sequences are variants of the known protein hypothetical protein LOC253012 isoform 1 (RefSeq accession identifier NP_001034461 (SEQ ID NO: 35), NP_937794 (SEQ ID NO: 36)), referred to herein as the previously known protein.

The known LOC253012 is a hypothetical protein that was computationally discovered during the secreted protein discovery initiative project (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins (Clark et al 2003, Genome Res. 13: 2265-2270). Its closest paralog that is experimentally validated is the hepatocyte cell adhesion molecule (Refseq accession NP_689935) (evalue e-21).

LOC253012 antigen has been reported in European patent application Number EP162070, assigned to Genentech Inc. purports that LOC253012 (PRO346), corresponding to H68654_1_P2 (SEQ ID NO:35), is differentially expressed in a lung tumor samples. This patent specification alleges that the corresponding polypeptide can be used for developing antibodies against the disclosed PRO antigens, including PRO346. The patent application further suggests that antibodies to a genus of polypeptides reported therein that includes PRO346 may be used for treating and diagnosing cancer and specifically for diagnosing and treating lung cancer. The Genentech patent application does not teach, however, that LOC253012 (PRO346) is differentially expressed in small cell lung carcinoma. Also, there is no specific teaching in Genentech Inc. patent application that the anti-PRO346 antibodies can be used for treating small cell lung carcinomas and/or for modulating co-stimulation of the APC/T cell activity. Also, there is no teaching in the Genentech Inc. patent application that the PRO346 is an immune costimulatory protein or more specifically a B7 family member. There is no teaching in the Genentech Inc. patent application of the use of antibodies against the PRO346 antigen for modulating immune co-stimulation or particularly the B7 co-stimulatory pathway.

According to the present invention, LOC253012 was predicted to be a novel immune costimulatory protein and in particular a B7 co-stimulatory protein. The prediction was based on the presence of both an IgV domain and IgC2, a characteristic structural feature of the B7-family members. Like other B7 members, LOC253012 is also a type I membrane protein. LOC253012 and its variants were demonstrated in the present invention to be overexpressed in lung cancer.

MED discovery engine described in Example 1 herein, was used to assess the expression of LOC253012 transcripts. Expression data for Affymetrix probe set 242601 at representing the LOC253012 gene data is shown in FIG. 29. As evident from the scatter plot, presented in FIG. 29, the expression of LOC253012 transcripts detectable with the above probe sets was higher in lung cancer compared to normal lung samples.

As noted above, cluster H68654 features 10 transcripts, which were listed in Table 62 above. These transcripts encode for proteins which are variants of protein hypothetical protein LOC253012 isoform 1 (SEQ ID NO:35). A description of each variant protein according to the present invention is now provided.

Variant protein H68654_1_P2 (SEQ ID NO:35) according to the present invention has an amino acid sequence as encoded by transcripts H68654_1_T0 (SEQ ID NO:25) and H68654_1_T5 (SEQ ID NO:27).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P2 (SEQ ID NO:35) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 65, (given according to their positions on the amino acid sequence, with the alternative amino acids listed).

TABLE 65 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 33 K -> R 86 K -> N 204 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 66:

TABLE 66 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin subtype HMMSmart 38-141, 155-235, 255-333 Immunoglobulin-like Profile 149-233, 235-331 Scan Immunoglobulin-like HMMPfam 163-221 Immunoglobulin V-set HMMPfam  31-143 Immunoglobulin C2 type HMMSmart 164-226

The coding portion of transcript H68654_1_T0 (SEQ ID NO:25) starts at position 79 and ends at position 1464. The transcript also has the following SNPs as listed in Table 67 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 67 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 176, 1545 G -> A 336, 1754 G -> C  336 T -> C 564, 689 A -> T 1545 G -> T 1684 T -> A 1755 T -> G 1755

The coding portion of transcript H68654_1_T5 (SEQ ID NO:27) starts at position 79 and ends at position 1464. The transcript also has the following SNPs as listed in Table 68 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 68 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 176, 1545 G -> A 336, 1754 G -> C  336 T -> C 564, 689 A -> T 1545 G -> T 1684 T -> A 1755 T -> G 1755

The genomic structure of protein H68654_1_P2 (SEQ ID NO: 35) (number of exons relevant to the extra-cellular region of the protein, the length of these exons, the frame of the codon in which the introns are inserted and the location of the protein features and domains in the gene structure) is characteristic to the ligands of the B7/co-stimulatory protein family, as given in table 69.

TABLE 69 genomic structure and protein features Exon Exon Amino- number Length Acids Protein feature on exon 1 79  1-26 Signal Peptide 2 351  27-143 IgV domain 3 285 144-238 IgC2 domain 4 297 239-337 Ig-like domain 5 126 338-379 Trans-membrane region 6 25 380-387 7 38 388-400 8 74 401-425 9 110 426-462 10 1 462-462

Variant protein H68654_1_P5 (SEQ ID NO:36) according to the present invention has an amino acid sequence encoded by transcripts H68654_1_T4 (SEQ ID NO:26).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P5 (SEQ ID NO:36) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 70, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:36)).

TABLE 70 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 21 K -> R 74 K -> N 192 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 71:

TABLE 71 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin C2 type HMMSmart 152-214 Immunoglobulin subtype HMMSmart 26-129, 143-223, 243-321 Immunoglobulin-like HMMPfam 151-209 Immunoglobulin V-set HMMPfam  19-131 Immunoglobulin-like Profile 137-221, 223-319 Scan

Variant protein H68654_1_P5 (SEQ ID NO:36) is encoded by the transcript H68654_1_T4 (SEQ ID NO:26), for which the coding portion starts at position 102 and ends at position 1451. The transcript also has the following SNPs as listed in Table 72 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 72 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 163, 1532 G -> A 323, 1741 G -> C  323 T -> C 551, 676 A -> T 1532 G -> T 1671 T -> A 1742 T -> G 1742

Variant protein H68654_1_P7 (SEQ ID NO:37) according to the present invention has an amino acid sequence encoded by transcript H68654_1_T8 (SEQ ID NO:28). Alignment of H68654_1_P7 (SEQ ID NO:37) to one or more previously published protein sequences are shown in FIGS. 30A and 30B. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between H68654_1_P7 (SEQ ID NO:37) and known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36) (FIG. 30A):

A. An isolated chimeric polypeptide encoding for H68654_1_P7 (SEQ ID NO:37), comprising a first amino acid sequence being at least 90% homologous to

MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPAS DIQIIWLFERPHTMPKYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPL QFPDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEY VGNMTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKE DIGNYSCLVRNPVSEMESDIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDL GEAILFDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEKVAQKTMD YVCCAYNNITGRQDETHFTVIITSVGLEKLAQKGKSLSPLASITGISLFL IISMCLLFLWKKYQPYK corresponding to amino acids 1-367 of known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36), which also corresponds to amino acids 1-367 of H68654_1_P7 (SEQ ID NO:37), and a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence

(SEQ ID NO: 290) GQKQNTGKLKHFQAMKMLWMTSEYMNLLLFQMFLVFPGSQAGLFQPLIVYR GKICTVQCMKLFSTSLPSSKTIQSELSWAKQYIRVKF corresponding to amino acids 368-455 of H68654_1_P7 (SEQ ID NO:37), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of H68654_1_P7 (SEQ ID NO:37), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence

(SEQ ID NO: 290) GQKQNTGKLKHFQAMKMLWMTSEYMNLLLFQMFLVFPGSQAGLFQPLIVYR GKICTVQCMKLFSTSLPSSKTIQSELSWAKQYIRVKF of H68654_1_P7 (SEQ ID NO:37).

2. Comparison report between H68654_1_P7 (SEQ ID NO:37) and known proteins NP_001034461 (SEQ ID NO: 35) FIG. 30B):

A. An isolated chimeric polypeptide encoding for H68654_1_P7 (SEQ ID NO:37), comprising a first amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99%, homologous to a polypeptide having the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) corresponding to amino acids 1-14 of H68654_1_P7 (SEQ ID NO:37), a second amino acid sequence being at least 90% homologous to

GACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMP KYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQG NGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLA YQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMES DIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYS WIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTV IITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYK corresponding to amino acids 27-379 of known proteins NP_001034461 (SEQ ID NO: 35), which also corresponds to amino acids 15-367 of H68654_1_P7 (SEQ ID NO:37), and a third amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence

(SEQ ID NO: 290) GQKQNTGKLKHFQAMKMLWMTSEYMNLLLFQMFLVFPGSQAGLFQPLIVYR GKICTVQCMKLFSTSLPSSKTIQSELSWAKQYIRVKF corresponding to amino acids 368-455 of H68654_1_P7 (SEQ ID NO:37), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for a head of H68654_1_P7 (SEQ ID NO:37), comprising a polypeptide being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) of H68654_1_P7 (SEQ ID NO:37).

C. An isolated polypeptide encoding for an edge portion of H68654_1_P7 (SEQ ID NO:37), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence

(SEQ ID NO: 290) GQKQNTGKLKHFQAMKMLWMTSEYMNLLLFQMFLVFPGSQAGLFQPLIVYR GKICTVQCMKLFSTSLPSSKTIQSELSWAKQYIRVKF of H68654_1_P7 (SEQ ID NO:37).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P7 (SEQ ID NO:37) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 73, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:37)).

TABLE 73 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 21 K -> R 74 K -> N 192 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 74:

TABLE 74 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin subtype HMMSmart 26-129, 143-223, 243-321 Immunoglobulin C2 type HMMSmart 152-214 Immunoglobulin-like Profile 137-221, 223-319 Scan Immunoglobulin V-set HMMPfam  19-131 Immunoglobulin-like HMMPfam 151-209

Variant protein H68654_1_P7 (SEQ ID NO:37) is encoded by the transcript H68654_1_T8 (SEQ ID NO:28), for which the coding portion starts at position 102 and ends at position 1466. The transcript also has the following SNPs as listed in Table 75 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 75 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 163, 1507 G -> A 323, 1716 G -> C  323 T -> C 551, 676 A -> T 1507 G -> T 1646 T -> A 1717 T -> G 1717

Variant protein H68654_1_P12 (SEQ ID NO:38) according to the present invention has an amino acid sequence as encoded by transcripts H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30) and H68654_1_T18 (SEQ ID NO:32). Alignment of H68654_1_P12 (SEQ ID NO:38) to previously published protein sequences are shown in FIGS. 30C and 30D. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between H68654_1_P12 (SEQ ID NO:38) and known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36) (FIG. 30C):

A. An isolated chimeric polypeptide encoding for H68654_1_P12 (SEQ ID NO:38), comprising a first amino acid sequence being at least 90% homologous to

MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASD IQIIWLFERPHTMPKYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQF PDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGN MTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGN YSCLVRNPVSEMESDIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAIL FDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAY NNITGRQDETHFTVIITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLL FLWKKYQPYKVIKQKLEGRPETEYRKAQTFSGHEDALDDFGIYEFVAFPDV SGVSR corresponding to amino acids 1-413 of known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36), which also corresponds to amino acids 1-413 of H68654_1_P12 (SEQ ID NO:38), and a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence VGFPSG (SEQ ID NO: 291) corresponding to amino acids 414-419 of H68654_1_P12 (SEQ ID NO:38), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of H68654_1_P12 (SEQ ID NO:38), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence VGFPSG (SEQ ID NO: 291) of H68654_1_P12 (SEQ ID NO:38).

2. Comparison report between H68654_1_P12 (SEQ ID NO:38) and known proteins NP_001034461 (SEQ ID NO: 35) (FIG. 30D):

A. An isolated chimeric polypeptide encoding for H68654_1_P12 (SEQ ID NO:38), comprising a first amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99%, homologous to a polypeptide having the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) corresponding to amino acids 1-14 of H68654_1_P12 (SEQ ID NO:38), a second amino acid sequence being at least 90% homologous to

GACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMP KYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQG NGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLA YQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMES DIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYS WIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTV IITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQ KLEGRPETEYRKAQTFSGHEDALDDFGIYEFVAFPDVSGVSR corresponding to amino acids 27-425 of known proteins NP_001034461 (SEQ ID NO: 35), which also corresponds to amino acids 15-413 of H68654_1_P12 (SEQ ID NO:38), and a third amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence VGFPSG (SEQ ID NO: 291) corresponding to amino acids 414-419 of H68654_1_P12 (SEQ ID NO:38), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for a head of H68654_1_P12 (SEQ ID NO:38), comprising a polypeptide being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) of H68654_1_P12 (SEQ ID NO:38).

C. An isolated polypeptide encoding for an edge portion of H68654_1_P12 (SEQ ID NO:38), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence VGFPSG (SEQ ID NO: 291) of H68654_1_P12 (SEQ ID NO:38).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P12 (SEQ ID NO:38) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 76, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:38)).

TABLE 76 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 21 K - > R 74 K -> N 192 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 77.

TABLE 77 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin subtype HMMSmart 26-129, 143-223, 243-321 Immunoglobulin-like HMMPfam 151-209 Immunoglobulin C2 type HMMSmart 152-214 Immunoglobulin V-set HMMPfam  19-131 Immunoglobulin-like Profile 137-221, 223-319 Scan

The coding portion of transcript H68654_1_T15 (SEQ ID NO:29) starts at position 102 and ends at position 1358. The transcript also has the following SNPs as listed in Table 78 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 78 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 163, 1586 G -> A 323, 1795 G -> C  323 T -> C 551, 676 A -> T 1586 G -> T 1725 T -> A 1796 T -> G 1796

The coding portion of transcript H68654_1_ starts at position 102 and ends at position 1358. The transcript also has the following SNPs as listed in Table 79 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 79 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 163, 1588 G -> A 323, 1797 G -> C  323 T -> C 551, 676 A -> T 1588 G -> T 1727 T -> A 1798 T -> G 1798

The coding portion of transcript H68654_1_T18 (SEQ ID NO:32) starts at position 102 and ends at position 1358. The transcript also has the following SNPs as listed in Table 80 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 80 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G  163 G -> A 323, 2117 G -> C  323 T -> C 551, 676 T -> A 1603 T -> G 1603

Variant protein H68654_1_P13 (SEQ ID NO:39) according to the present invention has an amino acid sequence as encoded by transcripts H68654_1_T17 (SEQ ID NO:31) and H68654_1_T19 (SEQ ID NO:33). Alignments to one or more previously published protein sequences are shown in FIGS. 30E and 30F. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between H68654_1_P13 (SEQ ID NO:39) and known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36) (FIG. 30E):

A. An isolated chimeric polypeptide encoding for H68654_1_P13 (SEQ ID NO:39), comprising a amino acid sequence being at least 90% homologous to

MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASD IQIIWLFERPHTMPKYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQF PDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGN MTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGN YSCLVRNPVSEMESDIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAIL FDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAY NNITGRQDETHFTVIITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLL FLWKKYQPYKVIKQKLEGR corresponding to amino acids 1-376 of known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36), which also corresponds to amino acids 1-376 of H68654_1_P13 (SEQ ID NO:39), wherein said and first amino acid sequence are contiguous and in a sequential order.

2. Comparison report between H68654_1_P13 (SEQ ID NO:39) and known proteins NP_001034461 (SEQ ID NO: 35) (FIG. 30F):

A. An isolated chimeric polypeptide encoding for H68654_1_P13 (SEQ ID NO:39), comprising a first amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99%, homologous to a polypeptide having the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) corresponding to amino acids 1-14 of H68654_1_P13 (SEQ ID NO:39), and a second amino acid sequence being at least 90% homologous to

GACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMP KYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQG NGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLA YQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMES DIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYS WIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTV IITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQ KLEGR corresponding to amino acids 27-388 of known proteins NP_001034461 (SEQ ID NO: 35), which also corresponds to amino acids 15-376 of H68654_1_P13 (SEQ ID NO:39), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for a head of H68654_1_P13 (SEQ ID NO:39), comprising a polypeptide being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) of H68654_1_P13 (SEQ ID NO:39).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P13 (SEQ ID NO:39) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 81, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:39)).

TABLE 81 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 21 K -> R 74 K -> N 192 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 82:

TABLE 82 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin subtype HMMSmart 26-129, 143-223, 243-321 Immunoglobulin-like Profile 137-221, 223-319 Scan Immunoglobulin C2 type HMMSmart 152-214 Immunoglobulin V-set HMMPfam  19-131 Immunoglobulin-like HMMPfam 151-209

The coding portion of transcript H68654_1_T17 (SEQ ID NO:31) starts at position 102 and ends at position 1229. The transcript also has the following SNPs as listed in Table 83 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 83 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G 163, 1626 G -> A 323, 1835 G -> C  323 T -> C 551, 676 A -> T 1626 G -> T 1765 T -> A 1836 T -> G 1836

The coding portion of transcript H68654_1_T19 (SEQ ID NO:33) starts at position 102 and ends at position 1229. The transcript also has the following SNPs as listed in Table 84 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 84 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G  163 G -> A 323, 2211 G -> C  323 T -> C 551, 676 T -> A 1697 T -> G 1697

Variant protein H68654_1_P14 (SEQ ID NO:40) according to the present invention has an amino acid sequence as encoded by transcript H68654_1_T20 (SEQ ID NO:34). Alignments to previously published protein sequences are shown in FIGS. 30G and 30H. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between H68654_1_P14 (SEQ ID NO:40) and known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36) (FIG. 30G):

A. An isolated chimeric polypeptide encoding for H68654_1_P14 (SEQ ID NO:40), comprising a first amino acid sequence being at least 90% homologous to MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQII WLFERPHTMPKYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYI VKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEG GTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEM ESDIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYSWIR RTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTVIITSVGL EKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQKLEGRPETEYRKA QTFSG corresponding to amino acids 1-389 of known proteins NP_937794 and Q6UXI0_HUMAN (SEQ ID NO: 36), which also corresponds to amino acids 1-389 of H68654_1_P14 (SEQ ID NO:40), and a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence FMLAAPSQREEEKKIWQGPGLLLCPHCNPHYHQY (SEQ ID NO: 292) corresponding to amino acids 390-423 of H68654_1_P14 (SEQ ID NO:40), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of H68654_1_P14 (SEQ ID NO:40), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence FMLAAPSQREEEKKIWQGPGLLLCPHCNPHYHQY (SEQ ID NO: 292) of H68654_1_P14 (SEQ ID NO:40).

2. Comparison report between H68654_1_P14 (SEQ ID NO:40) and known proteins NP_001034461 (SEQ ID NO: 35) (FIG. 30H):

A. An isolated chimeric polypeptide encoding for H68654_1_P14 (SEQ ID NO:40), comprising a first amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99%, homologous to a polypeptide having the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) corresponding to amino acids 1-14 of H68654_1_P14 (SEQ ID NO:40), a second amino acid sequence being at least 90% homologous to

GACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMP KYLLGSVNKSVVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQG NGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEYVGNMTLTCHVEGGTRLA YQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMES DIIMPIIYYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYS WIRRTDNTTYIIKHGPRLEVASEKVAQKTMDYVCCAYNNITGRQDETHFTV IITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQ KLEGRPETEYRKAQTFSG corresponding to amino acids 27-401 of known proteins NP_001034461 (SEQ ID NO: 35), which also corresponds to amino acids 15-389 of H68654_1_P14 (SEQ ID NO:40), and a third amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence FMLAAPSQREEEKKIWQGPGLLLCPHCNPHYHQY (SEQ ID NO: 292) corresponding to amino acids 390-423 of H68654_1_P14 (SEQ ID NO:40), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for a head of H68654_1_P14 (SEQ ID NO:40), comprising a polypeptide being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence MWLKVFTTFLSFAT (SEQ ID NO: 289) of H68654_1_P14 (SEQ ID NO:40).

C. An isolated polypeptide encoding for an edge portion of H68654_1_P14 (SEQ ID NO:40), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence FMLAAPSQREEEKKIWQGPGLLLCPHCNPHYHQY (SEQ ID NO: 292) of H68654_1_P14 (SEQ ID NO:40).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H68654_1_P14 (SEQ ID NO:40) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 85, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:40)).

TABLE 85 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 21 K -> R 74 K -> N 192 L -> P

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 86:

TABLE 86 InterPro domains Analysis Domain description type Positions on protein Immunoglobulin C2 type HMMSmart 152-214 Immunoglobulin subtype HMMSmart 26-129, 143-223, 243-321 Immunoglobulin-like Profile 137-221, 223-319 Scan Immunoglobulin V-set HMMPfam  19-131 Immunoglobulin-like HMMPfam 151-209

Variant protein H68654_1_P14 (SEQ ID NO:40) is encoded by the transcript H68654_1_T20 (SEQ ID NO:34), for which the coding portion starts at position 102 and ends at position 1370. The transcript also has the following SNPs as listed in Table 87 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 87 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence A -> G  163 G -> A  323 G -> C  323 T -> C 551, 676 C -> A 1491 C -> G 1491

As noted above, cluster H68654 features 3 segments, which were listed in Table 63. These segments are portions of nucleic acid sequences which are described herein separately because they are of particular interest. A description of each segment according to the present invention is now provided.

Segment cluster H68654_1_N3 (SEQ ID NO:123) according to the present invention is supported by 24 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H68654_1_T0 (SEQ ID NO:25), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), H68654_1_T20 (SEQ ID NO:34), H68654_1_T4 (SEQ ID NO:26), H68654_1_T5 (SEQ ID NO:27) and H68654_1_T8 (SEQ ID NO:28). Table 88 below describes the starting and ending position of this segment on each transcript.

TABLE 88 Segment location on transcripts Segment Segment Transcript name starting position ending position H68654_1_T0 (SEQ ID NO: 25) 158 508 H68654_1_T15 (SEQ ID 145 495 NO: 29) H68654_1_T16 (SEQ ID 145 495 NO: 30) H68654_1_T17 (SEQ ID 145 495 NO: 31) H68654_1_T18 (SEQ ID 145 495 NO: 32) H68654_1_T19 (SEQ ID 145 495 NO: 33) H68654_1_T20 (SEQ ID 145 495 NO: 34) H68654_1_T4 (SEQ ID NO: 26) 145 495 H68654_1_T5 (SEQ ID NO: 27) 158 508 H68654_1_T8 (SEQ ID NO: 28) 145 495

Segment cluster H68654_1_N7 (SEQ ID NO:124) according to the present invention is supported by 20 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H68654_1_T0 (SEQ ID NO:25), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), H68654_1_T20 (SEQ ID NO:34), H68654_1_T4 (SEQ ID NO:26), H68654_1_T5 (SEQ ID NO:27) and H68654_1_T8 (SEQ ID NO:28). Table 89 below describes the starting and ending position of this segment on each transcript.

TABLE 89 Segment location on transcripts Segment Segment Transcript name starting position ending position H68654_1_T0 (SEQ ID NO: 25) 794 1090 H68654_1_T15 (SEQ ID 781 1077 NO: 29) H68654_1_T16 (SEQ ID 781 1077 NO: 30) H68654_1_T17 (SEQ ID 781 1077 NO: 31) H68654_1_T18 (SEQ ID 781 1077 NO: 32) H68654_1_T19 (SEQ ID 781 1077 NO: 33) H68654_1_T20 (SEQ ID 781 1077 NO: 34) H68654_1_T4 (SEQ ID NO: 26) 781 1077 H68654_1_T5 (SEQ ID NO: 27) 794 1090 H68654_1_T8 (SEQ ID NO: 28) 781 1077

Segment cluster H68654_1_N12 (SEQ ID NO:125) according to the present invention is supported by 18 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H68654_1_T0 (SEQ ID NO:25), H68654_1_T15 (SEQ ID NO:29), H68654_1_T16 (SEQ ID NO:30), H68654_1_T17 (SEQ ID NO:31), H68654_1_T18 (SEQ ID NO:32), H68654_1_T19 (SEQ ID NO:33), H68654_1_T20 (SEQ ID NO:34), H68654_1_T4 (SEQ ID NO:26), H68654_1_T5 (SEQ ID NO:27) and H68654_1_T8 (SEQ ID NO:28). Table 90 below describes the starting and ending position of this segment on each transcript.

TABLE 90 Segment location on transcripts Segment Segment Transcript name starting position ending position H68654_1_T0 (SEQ ID NO: 25) 1091 1216 H68654_1_T15 (SEQ ID 1078 1203 NO: 29) H68654_1_T16 (SEQ ID 1078 1203 NO: 30) H68654_1_T17 (SEQ ID 1078 1203 NO: 31) H68654_1_T18 (SEQ ID 1078 1203 NO: 32) H68654_1_T19 (SEQ ID 1078 1203 NO: 33) H68654_1_T20 (SEQ ID 1078 1203 NO: 34) H68654_1_T4 (SEQ ID NO: 26) 1078 1203 H68654_1_T5 (SEQ ID NO: 27) 1091 1216 H68654_1_T8 (SEQ ID NO: 28) 1078 1203

Expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg3WTF2R2 (SEQ ID NO: 226) in normal and cancerous Lung tissues and in different normal tissues

Expression of hypothetical protein LOC253012 transcripts detectable by or according to seg3F2R2-H68654_seg3WTF2R2 (SEQ ID NO: 226) amplicon and primers H68654_seg3WTF2 (SEQ ID NO: 224) and H68654_seg3WTR2 (SEQ ID NO: 225) was measured by real time PCR on lung panel and normal panel. The samples used are detailed in Table 3 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Lung panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 31 is a histogram showing over expression of the above-indicated hypothetical protein LOC253012 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 31, the expression of hypothetical protein LOC253012 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above). Notably an over-expression of at least 80 fold was found in 7 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below.

The P value for the difference in the expression levels of hypothetical protein LOC253012 transcripts detectable by the above amplicon in Lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 3.24e-003.

Threshold of 80 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 7.49e-005 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the lung samples (sample numbers 26, 28, 29 and 30, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples, as shown in FIG. 32.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H68654_seg3WTF2 (SEQ ID NO: 224) forward primer; and H68654_seg3WTR2 (SEQ ID NO: 225) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H68654_seg3WTF2R2 (SEQ ID NO: 226).

Forward Primer >H68654_seg3WTF2 (SEQ ID NO: 224): ATCACACACTGTCCATGGCGT Reverse Primer >H68654_seg3WTR2 (SEQ ID NO: 225): GTCTCTCAAATAGCCATATGATCTGG Amplicon >H68654_seg3WTF2R2 (SEQ ID NO: 226) ATCACACACTGTCCATGGCGTCAGAGGTCAGGCCCTC TACCTACCCGTCCACTATGGCTTCCACACTCCAGCAT CAGACATCCAGATCATATGGCTATTTGAGAGAC

Expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg7-12WT (SEQ ID NO: 223) in normal and cancerous Lung tissues and in different normal tissues

Expression of hypothetical protein LOC253012 transcripts detectable by or according to seg7-12-H68654_seg7-12WT (SEQ ID NO: 223) amplicon and primers H68654_seg7-12WTF1 (SEQ ID NO: 221) and H68654_seg7-12WTR1 (SEQ ID NO: 222) was measured by real time PCR on lung panel and normal panel. The samples used are detailed in Table 3 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Lung panel—Non-detected samples (samples no. 30, 41, 78 and 92, Table 3) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 33 is a histogram showing over expression of the above-indicated hypothetical protein LOC253012 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 33, the expression of hypothetical protein LOC253012 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above) and was higher in a few squamous cell carcinoma samples. Notably an over-expression of at least 25 fold was found in 6 out of 9 small cell carcinoma samples and in 4 out of 24 squamous cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of hypothetical protein LOC253012 transcripts detectable by the above amplicon in Lung small cell carcinoma samples and Lung squamous cell carcinoma samples versus the normal tissue samples was determined by T test as 1.24e-002 and 2.97e-002, respectively.

Threshold of 25 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 4.74e-004 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—Non-detected sample (sample no. 16) was assigned Ct value of 41 and was calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the lung samples (sample numbers 26, 28, 29 and 30, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples, as shown in FIG. 34.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H68654_seg7-12WTF1 (SEQ ID NO: 221) forward primer; and H68654_seg7-12WTR1 (SEQ ID NO: 222) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H68654_seg7-12WT (SEQ ID NO: 223).

Forward Primer >H68654_seg7-12WTF1 (SEQ ID NO: 221): ATGGGCCTCGCTTAGAAGTTG Reverse Primer >H68654_seg7-12WTR1 (SEQ ID NO: 222): TTCTGTGCAAGCTTCTCCAGTC Amplicon >H68654_seg7-12WT (SEQ ID NO: 223): ATGGGCCTCGCTTAGAAGTTGCATCTGAGAAAGTAGCCCAGAAGACAATG GACTATGTGTGCTGTGCTTACAACAACATAACCGGCAGGCAAGATGAAAC TCATTTCACAGTTATCATCACTTCCGTAGGACTGGAGAAGCTTGCACAGA A

Expression of LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg3WTF2R2 (SEQ ID NO: 226) in the blood-specific panel.

Expression of LOC253012 transcripts detectable by or according to seg3-H68654seg3F2R2 (SEQ ID NO: 226) amplicon and primers H68654seg3F2 (SEQ ID NO: 224) and H68654seg3R2 (SEQ ID NO: 225) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 35A. Expression of the above-indicated LOC253012 transcript is high in the kidney normal, colon normal and small intestine normal asin few of the different blood samples checked.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H68654_seg3WTF2 forward primer; and H68654_seg3WTR2 reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H68654_seg3WTF2R2.

Forward Primer >H68654_seg3WTF2: (SEQ ID NO: 224) ATCACACACTGTCCATGGCGT Reverse Primer >H68654_seg3WTR2:  (SEQ ID NO: 225) GTCTCTCAAATAGCCATATGATCTGG Amplicon >H68654_seg3WTF2R2 (SEQ ID NO: 226) ATCACACACTGTCCATGGCGTCAGAGGTCAGGCCCTCTACCTACCCGTCC ACTATGGCTTCCACACTCCAGCATCAGACATCCAGATCATATGGCTATTT GAGAGAC

Expression of LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654seg7-12 (SEQ ID NO: 223) in the blood-specific panel.

Expression of LOC253012 transcripts detectable by or according to seg7-12-H68654seg7-12WTF1R1 (SEQ ID NO: 223) amplicon and primers H68654seg7-12WTF1 (SEQ ID NO: 221) and H68654seg7-12WTR1 (SEQ ID NO: 222) was measured by real time PCR on blood panel. The samples used are detailed in Table 1 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 35B. Expression of the above-indicated LOC253012 transcript is higher in the kidney normal, colon normal and small intestine normal relative to the different blood samples checked.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: seg7-12WTF1 forward primer (SEQ ID NO: 221); and seg7-12WTR1 reverse primer (SEQ ID NO: 222).

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: seg7-12WTF1R1 (SEQ ID NO: 223).

Forward Primer >H68654_seg7-12WTF1 (SEQ ID NO: 221) ATGGGCCTCGCTTAGAAGTTG Reverse Primer >H68654_seg7-12WTR1 (SEQ ID NO: 222) TTCTGTGCAAGCTTCTCCAGTC Amplicon >H68654_seg7-12WT (SEQ ID NO: 223) ATGGGCCTCGCTTAGAAGTTGCATCTGAGAAAGTAGCCCAGAAGACAATG GACTATGTGTGCTGTGCTTACAACAACATAACCGGCAGGCAAGATGAAAC TCATTTCACAGTTATCATCACTTCCGTAGGACTGGAGAAGCTTGCACAGA A

Expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg0-3 (SEQ ID NO: 229) in normal and cancerous Lung tissues

Expression of LOC253012 transcripts detectable by or according to seg0-3-H68654_seg0-3 (SEQ ID NO: 229) amplicon and primers H68654_seg0-3F1 (SEQ ID NO: 227) and H68654_seg0-3R1 (SEQ ID NO: 228) was measured by real time PCR on lung panel. The samples used are detailed in Table 3 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-54, 56-64, 69 and 70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 36 is a histogram showing over expression of the above-indicated LOC253012 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 36, the expression of LOC253012 transcripts detectable by the above amplicon in squamous cell carcinoma and small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-54, 56-64, 69 and 70, Table 3 above). Notably an over-expression of at least 5 fold was found in 6 out of 21 squamous cell carcinoma samples and in 8 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of LOC253012 transcripts detectable by the above amplicon in lung squamous cell carcinoma samples and lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 4.96e-002 and 1.05e-003, respectively.

Threshold of 5 fold over expression was found to differentiate between squamous cell carcinoma and small cell carcinoma and normal samples with P value of 2.79e-002 and 1.22e-005, respectively, as checked by exact Fisher test. The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H68654_seg0-3F1 (SEQ ID NO: 227) forward primer; and H68654_seg0-3R1 (SEQ ID NO: 228) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H68654_seg0-3F1R1 (SEQ ID NO: 229).

Forward Primer >H68654_seg0-3F1 (SEQ ID NO: 227) GCTTTCATGGAGCCCTTCG Reverse Primer >H68654_seg0-3R1 (SEQ ID NO: 228) GCCTGACCTCTGACGCCA Amplicon >H68654_seg0-3F1R1 (SEQ ID NO: 229) GCTTTCATGGAGCCCTTCGGTGACACACTTGGGGTCTTTCAGTGCAAAAT ATACCTCCTTCTCTTCGGTGCTTGCTCGGGGCTGAAGGTGACAGTGCCAT CACACACTGTCCATGGCGTCAGAGGTCAGGC

Expression of hypothetical protein LOC253012 H68654 transcripts which are detectable by amplicon as depicted in sequence name H68654_seg2-3 (SEQ ID NO: 232) in normal and cancerous Lung tissues

Expression of LOC253012 transcripts detectable by or according to seg2-3-H68654_seg2-3 (SEQ ID NO: 232) amplicon and primers H68654_seg2-3F1 (SEQ ID NO: 230) and H68654_seg2-3R1 (SEQ ID NO: 231) was measured by real time PCR on lung panel. The samples used are detailed in Table 3 above. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51, 52, 54-64, 69 and 70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 37 is a histogram showing over expression of the above-indicated LOC253012 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 37, the expression of LOC253012 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51, 52, 54-64, 69 and 70, Table 3 above) Notably an over-expression of at least 8 fold was found in 7 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of LOC253012 transcripts detectable by the above amplicon in Lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 1.93e-002.

Threshold of 8 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 1.04e-004 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H68654_seg2-3F1 (SEQ ID NO: 230) forward primer; and H68654_seg2-3R1 (SEQ ID NO: 231) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H68654_seg2-3F1R1 (SEQ ID NO: 232).

Forward Primer >H68654_seg2-3F1 (SEQ ID NO: 230) CTCTGCATTTGCCCCTTTAGA Reverse Primer >H68654_seg2-3R1 (SEQ ID NO: 231) GATGGCACTGTCACCTTCAGC Amplicon >H68654_seg2-3F1R1 (SEQ ID NO: 232) CTCTGCATTTGCCCCTTTAGATTGTGAAATGTGGCTCAAGGTCTTCACAA CTTTCCTTTCCTTTGCAACAGGTGCTTGCTCGGGGCTGAAGGTGACAGTG CCATC

Example 6 Description for Cluster H19011_1

The present invention relates to C1ORF32 polypeptides, novel splice variants and diagnostics and therapeutics based thereon.

Cluster H19011_1 (internal ID 76432827) features 2 transcripts and 5 segments of interest, the names for which are given in Tables 91 and 92, respectively. The selected protein variants are given in table 93.

TABLE 91 Transcripts of interest Transcript Name H19011_1_T8 (SEQ ID NO: 45) H19011_1_T9 (SEQ ID NO: 46)

TABLE 92 Segments of interest Segment Name H19011_1_N13 (SEQ ID NO: 129) H19011_1_N8 (SEQ ID NO: 130) H19011_1_N10 (SEQ ID NO: 131) H19011_1_N11 (SEQ ID NO: 132) H19011_1_N12 (SEQ ID NO: 133)

TABLE 93 Proteins of interest Protein Name Corresponding Transcripts H19011_1_P8 (SEQ ID NO: 48) H19011_1_T8 (SEQ ID NO: 45) H19011_1_P9 (SEQ ID NO: 50) H19011_1_T9 (SEQ ID NO: 46)

These sequences are variants of the known protein hypothetical protein LOC387597 (RefSeq accession identifier NP_955383 (SEQ ID NO: 47), synonims: C1ORF32, NP_955383; LISCH-like; RP4-782G3.2; dJ782G3.1), referred to herein as the previously known protein.

C1ORF32 is a hypothetical protein that was computationally discovered during the annotation of chromosome 1 (Gregory S G et al. 2006, Nature 441 (7091) 315-321). Its closest annotated homolog belongs to the LISCH7 family, a subfamily of the immunoglobulin super family. One of the annotated members of this family is the lipolysis-stimulated lipoprotein receptor which has a probable role in the clearance of triglyceride-rich lipoprotein from blood (Swissprot annotation of accession Q86×29).

According to the present invention, C1ORF32 was predicted to be a novel B7/CD28 member based on the presence of an IgV domain, in addition of its being a type I membrane protein, like other known B7 members. Also, two alternatively spliced variants of the present invention (H19011_1_P8 (SEQ ID NO:48) and H19011_1_P9 (SEQ ID NO:50)), which share only the first 5 exons with the wild type C1ORF32, are similar to the known B7 family members in their exons' sizes and the position of the IgV and transmembrane domains within these exons. In addition, C1ORF32 was shown in the present invention to be overexpressed in small cell lung cancer.

As noted above, cluster H19011 features 2 transcripts, which were listed in Table 91 above. These transcripts encode for proteins which are variants of protein hypothetical protein LOC387597 (SEQ ID NO:47). A description of each variant protein according to the present invention is now provided.

Variant protein H19011_1_P8 (SEQ ID NO:48) according to the present invention has an amino acid sequence as encoded by transcript H19011_1_T8 (SEQ ID NO:45). Alignments to one or more previously published protein sequences are shown in FIG. 38A. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

Comparison report between H19011_1_P8 (SEQ ID NO:48) and known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47) (FIG. 38A):

A. An isolated chimeric polypeptide encoding for H19011_1_P8 (SEQ ID NO:48), comprising a first amino acid sequence being at least 90% homologous to

MDRVLLRWISLFWLTAMVEGLQVTVPDKKKVAMLFQPTVLRCHFSTSSHQP AVVQWKFKSYCQDRMGESLGMSSTRAQSLSKRNLEWDPYLDCLDSRRTVRV VASKQGSTVTLGDFYRGREITIVHDADLQIGKLMWGDSGLYYCIITTPDDL EGKNE corresponding to amino acids 1-158 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 1-158 of H19011_1_P8 (SEQ ID NO:48), a bridging amino acid G corresponding to amino acid 159 of H19011_1_P8 (SEQ ID NO:48), a second amino acid sequence being at least 90% homologous to S corresponding to amino acids 160-160 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 160-160 of H19011_1_P8 (SEQ ID NO:48), bridging amino acids LG corresponding to amino acid 161-162 of H19011_1_P8 (SEQ ID NO:48), a third amino acid sequence being at least 90% homologous to

LLVLGRTGLLADLLPSFAVEIMPEWVFVGLVLLGVFLFFVLVGICWCQCCP HSCCCYVRCPCCPDSC corresponding to amino acids 163-229 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 163-229 of H19011_1_P8 (SEQ ID NO:48), a bridging amino acid W corresponding to amino acid 230 of H19011_1_P8 (SEQ ID NO:48), a fourth amino acid sequence being at least 90% homologous to CPQA corresponding to amino acids 231-234 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 231-234 of H19011_1_P8 (SEQ ID NO:48), and a fifth amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence CEYSDRWGDRAIERNVYLST (SEQ ID NO: 293) corresponding to amino acids 235-254 of H19011_1_P8 (SEQ ID NO:48), wherein said first amino acid sequence, bridging amino acid, second amino acid sequence, bridging amino acid, third amino acid sequence, bridging amino acid, fourth amino acid sequence and fifth amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of H19011_1_P8 (SEQ ID NO:48), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence CEYSDRWGDRAIERNVYLST (SEQ ID NO: 293) of H19011_1_P8 (SEQ ID NO:48).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H19011_1_P8 (SEQ ID NO:48) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 94, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:48)). An example of such a deduced sequence, with alternative amino-acids, that was produced (using part of the SNPs below), is given under the name H19011_1_P8_V1 (SEQ ID NO:49).

TABLE 94 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 159 G -> D 161 L -> V 162 G -> E 202 V -> D 202 V -> G 230 W -> C

The variant protein has the following domains, as determined by using InterPro. The domains are described in Table 95:

TABLE 95 InterPro domains Analysis Domain description type Positions on protein LISCH7 HMMPfam 186-234 IG SMART  27-166

Variant protein H19011_1_P8 (SEQ ID NO:48) is encoded by the transcript H19011_1_T8 (SEQ ID NO:45), for which the coding portion starts at position 181 and ends at position 942. The transcript also has the following SNPs as listed in Table 96 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 96 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> A 656 C -> G 661 G -> A 665 T -> A 785 T -> G 785 G -> C 870

The genomic structure of protein H19011_1_P8 (SEQ ID NO:48) (number of exons relevant to the extra-cellular region of the protein, the length of these exons, the frame of the codon in which the introns are inserted and the location of the protein features and domains in the gene structure) is characteristic to the ligands of the B7/co-stimulatory protein family, as given in table 97

TABLE 97 genomic structure and protein features Exon Exon Amino- number Length Acids Protein feature on exon 1 46  1-15 Signal Peptide 2 333  16-126 IgV domain 3 120 127-166 IgC2 domain 4 57 167-185 5 206 186-254 Trans-membrane region

Variant protein H19011_1_P9 (SEQ ID NO:50) according to the present invention has an amino acid sequence as encoded by transcript H19011_1_T9 (SEQ ID NO:46). Alignments to one or more previously published protein sequences are shown in FIG. 38B. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

Comparison report between H19011_1_P9 (SEQ ID NO:50) and known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47) (FIG. 38B):

A. An isolated chimeric polypeptide encoding for H19011_1_P9 (SEQ ID NO:50), comprising a first amino acid sequence being at least 90% homologous to

MDRVLLRWISLFWLTAMVEGLQVTVPDKKKVAMLFQPTVLRCHFSTSSHQP AVVQWKFKSYCQDRMGESLGMSSTRAQSLSKRNLEWDPYLDCLDSRRTVRV VASKQGSTVTLGDFYRGREITIVHDADLQIGKLMWGDSGLYYCIITTPDDL EGKNE corresponding to amino acids 1-158 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 1-158 of H19011_1_P9 (SEQ ID NO:50), a bridging amino acid G corresponding to amino acid 159 of H19011_1_P9 (SEQ ID NO:50), a second amino acid sequence being at least 90% homologous to S corresponding to amino acids 160-160 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 160-160 of H19011_1_P9 (SEQ ID NO:50), bridging amino acids LG corresponding to amino acid 161-162 of H19011_1_P9 (SEQ ID NO:50), a third amino acid sequence being at least 90% homologous to LLVL corresponding to amino acids 163-166 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 163-166 of H19011_1_P9 (SEQ ID NO:50), a fourth amino acid sequence being at least 90% homologous to EWVFVGLVLLGVFLFFVLVGICWCQCCPHSCCCYVRCPCCPDSC corresponding to amino acids 186-229 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 167-210 of H19011_1_P9 (SEQ ID NO:50), a bridging amino acid W corresponding to amino acid 211 of H19011_1_P9 (SEQ ID NO:50), a fifth amino acid sequence being at least 90% homologous to CPQA corresponding to amino acids 231-234 of known proteins Q71H61_HUMAN and NP_955383 (SEQ ID NO: 47), which also corresponds to amino acids 212-215 of H19011_1_P9 (SEQ ID NO:50), and a sixth amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence CEYSDRWGDRAIERNVYLST (SEQ ID NO: 293) corresponding to amino acids 216-235 of H19011_1_P9 (SEQ ID NO:50), wherein said first amino acid sequence, bridging amino acid, second amino acid sequence, bridging amino acid, third amino acid sequence, fourth amino acid sequence, bridging amino acid, fifth amino acid sequence and sixth amino acid sequence are contiguous and in a sequential order.

B. An isolated chimeric polypeptide encoding for an edge portion of H19011_1_P9 (SEQ ID NO:50), comprising a polypeptide having a length “n”, wherein n is at least about 10 amino acids in length, optionally at least about 20 amino acids in length, preferably at least about 30 amino acids in length, more preferably at least about 40 amino acids in length and most preferably at least about 50 amino acids in length, wherein at least two amino acids comprise LE, having a structure as follows: a sequence starting from any of amino acid numbers 166−x to 166; and ending at any of amino acid numbers 167+((n−2)−x), in which x varies from 0 to n−2.

C. An isolated polypeptide encoding for an edge portion of H19011_1_P9 (SEQ ID NO:50), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence CEYSDRWGDRAIERNVYLST (SEQ ID NO: 293) of H19011_1_P9 (SEQ ID NO:50).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein H19011_1_P9 (SEQ ID NO:50) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 98, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:50)).

TABLE 98 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 159 G -> D 161 L -> V 162 G -> E 183 V -> D 183 V -> G 211 W -> C

Variant protein H19011_1_P9 (SEQ ID NO:50) is encoded by the transcript H19011_1_T9 (SEQ ID NO:46), for which the coding portion starts at position 181 and ends at position 885. The transcript also has the following SNPs as listed in Table 99 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 99 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> A 656 C -> G 661 G -> A 665 T -> A 728 T -> G 728 G -> C 813

As noted above, cluster H19011 features 5 segments, which were listed in Table 92 above. These segments are portions of nucleic acid sequences which are described herein separately because they are of particular interest. A description of each segment according to the present invention is now provided.

Segment cluster H19011_1_N13 (SEQ ID NO:129) according to the present invention is supported by 3 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H19011_1_T8 (SEQ ID NO:45) and H19011_1_T9 (SEQ ID NO:46). Table 100 below describes the starting and ending position of this segment on each transcript.

TABLE 100 Segment location on transcripts Segment Segment Transcript name starting position ending position H19011_1_T8 (SEQ ID NO: 45) 884 1407 H19011_1_T9 (SEQ ID NO: 46) 827 1350

According to an optional embodiment of the present invention, short segments related to the above cluster are also provided. These segments are up to about 120 bp in length, and so are included in a separate description.

Segment cluster H19011_1_N8 (SEQ ID NO:130) according to the present invention is supported by 4 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H19011_1_T8 (SEQ ID NO:45). Table 101 below describes the starting and ending position of this segment on each transcript.

TABLE 101 Segment location on transcripts Segment Segment Transcript name starting position ending position H19011_1_T8 (SEQ ID NO: 45) 680 736

Segment cluster H19011_1_N10 (SEQ ID NO:131) according to the present invention is supported by 5 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H19011_1_T8 (SEQ ID NO:45) and H19011_1_T9 (SEQ ID NO:46). Table 102 below describes the starting and ending position of this segment on each transcript.

TABLE 102 Segment location on transcripts Segment Segment Transcript name starting position ending position H19011_1_T8 (SEQ ID NO: 45) 737 797 H19011_1_T9 (SEQ ID NO: 46) 680 740

Segment cluster H19011_1_N11 (SEQ ID NO:132) according to the present invention is supported by 3 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H19011_1_T8 (SEQ ID NO:45) and H19011_1_T9 (SEQ ID NO:46). Table 103 below describes the starting and ending position of this segment on each transcript.

TABLE 103 Segment location on transcripts Segment Segment Transcript name starting position ending position H19011_1_T8 (SEQ ID NO: 45) 798 863 H19011_1_T9 (SEQ ID NO: 46) 741 806

Segment cluster H19011_1_N12 (SEQ ID NO:133) according to the present invention is supported by 5 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: H19011_1_T8 (SEQ ID NO:45) and H19011_1_T9 (SEQ ID NO:46). Table 104 below describes the starting and ending position of this segment on each transcript.

TABLE 104 Segment location on transcripts Segment Segment Transcript name starting position ending position H19011_1_T8 (SEQ ID NO: 45) 864 883 H19011_1_T9 (SEQ ID NO: 46) 807 826

Expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg13F2R2 (SEQ ID NO: 235) in normal and cancerous Colon tissues, in normal and cancerous Lung tissues and in different normal tissues

Expression of C1ORF32, chromosome 1 open reading frame 32, transcripts detectable by or according to seg13-H19011_seg13F2R2 (SEQ ID NO: 235) amplicon and primers H19011_seg13F2 (SEQ ID NO: 233) and H19011_seg13R2 (SEQ ID NO: 234) was measured by real time PCR on colon panel, lung panel and normal panel. The samples used are detailed in Table 5, Table 3 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Colon panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 39 is a histogram showing down regulation of the above-indicated C1ORF32 transcripts in cancerous Colon samples relative to the normal samples.

As is evident from FIG. 39, the expression of C1ORF32 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-70, Table 5 above). Notably down regulation of at least 6 fold was found in 17 out of 55 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of C1ORF32 transcripts detectable by the above amplicon in Colon cancer samples versus the normal tissue samples was determined by T test as 9.36e-004.

Threshold of 6 fold down regulation was found to differentiate between cancer and normal samples with P value of 2.67e-004 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Lung panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 40 is a histogram showing over expression of the above-indicated C1ORF32 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 40, the expression of C1ORF32 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above). Notably an over-expression of at least 6 fold was found in 9 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below.

The P value for the difference in the expression levels of C1ORF32 transcripts detectable by the above amplicon in Lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 3.43e-003.

Threshold of 6 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 4.89e-007 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the colon samples (sample numbers 3, 4 and 5, Table 2 above), to obtain a value of relative expression of each sample relative to median of the colon samples, as shown in FIG. 41A. The normalized quantity of each RT sample was then divided by the median of the quantities of the lung samples (sample numbers 26, 28, 29 and 30, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples, as shown in FIG. 41B.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H19011_seg13F2 (SEQ ID NO: 233) forward primer; and H19011_seg13R2 (SEQ ID NO: 234) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H19011_seg13F2R2 (SEQ ID NO: 235).

Forward Primer >H19011_seg13F2 (SEQ ID NO: 233): GTGAGTACAGTGACCGCTGGG Reverse Primer >H19011_seg13R2 (SEQ ID NO: 234): GGAGAAGAGTCTGGAATGACCAA Amplicon >H19011_seg13F2R2 (SEQ ID NO: 235) GTGAGTACAGTGACCGCTGGGGAGACAGAGCGATCGAGAGAAATGTCTAC CTCTCTACCTGACAGCTGTGTGCGCTGGGTTCCTCCTCCACCTCCTGTCC TGCCACCCCCAAGATTGGTCATTCCAGACTCTTCTCC

Expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_seg8-13F1R1 (SEQ ID NO: 238) in normal and cancerous Lung tissues

Expression of C1ORF32, chromosome 1 open reading frame 32, transcripts detectable by or according to seg8-13F1R1-H19011_seg8-13F1R1 (SEQ ID NO: 238) amplicon and primers H19011_seg8-13F1 (SEQ ID NO: 236) and H19011_seg8-13R1 (SEQ ID NO: 237) was measured by real time PCR on lung panel. The samples used are detailed in Table 3 above. Samples that showed no detection of the amplicon (samples no. 1, 2, 4-10, 12-27, 29-35, 37-41, 51-64 and 69-70, Table 3) were assigned Ct value of 41 and were calculated accordingly. These samples showed a primer-dimer product with a characteristic dissociation curve and a significantly lower TM (this artefactual product was identified by its appearance in the negative control without RT sample). For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64 and 69-70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 42 is a histogram showing over expression of the above-indicated C1ORF32 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 42, the expression of C1ORF32 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51-64 and 69-70, Table 3 above). Notably an over-expression of at least 500 fold was found in 9 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of C1ORF32 transcripts detectable by the above amplicon in Lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 6.70e-003.

Threshold of 500 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 4.89e-007 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H19011_seg8-13F1 (SEQ ID NO: 236) forward primer; and H19011_seg8-13R1 (SEQ ID NO: 237) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H19011_seg8-13F1R1 (SEQ ID NO: 238).

Forward Primer >H19011_seg8-13F1 (SEQ ID NO: 236): GCCCAGTTTTGCTGTGGAGA Reverse Primer >H19011_seg8-13R1 (SEQ ID NO: 237): GGTAGACATTTCTCTCGATCGCTC Amplicon >H19011_seg8-13F1R1 (SEQ ID NO: 238) GCCCAGTTTTGCTGTGGAGATTATGCCAGAGTGGGTGTTTGTTGGCCTGG TGCTCCTGGGCGTCTTCCTCTTCTTCGTCCTGGTGGGGATCTGCTGGTGC CAGTGCTGCCCTCACAGCTGCTGCTGCTATGTCCGCTGCCCATGCTGCCC AGATTCCTGCTGGTGCCCTCAAGCCTGTGAGTACAGTGACCGCTGGGGAG ACAGAGCGATCGAGAGAAATGTCTACC

Expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc8-10seg13 (SEQ ID NO: 241) in normal and cancerous lung tissues, in normal and cancerous colon tissues, in different normal tissues and in the blood-specific panel.

Expression of C1ORF32 transcripts detectable by or according to junc8-10seg13-H19011_junc8-10seg13 (SEQ ID NO: 241) amplicon and primers H19011_junc8-10seg13F1 (SEQ ID NO: 239) and H19011_junc8-10seg13R1 (SEQ ID NO: 240) was measured by real time PCR lung panel, colon panel, normal panel and blood panel. The samples used are detailed in Table 3, Table 5, Table 2 and Table 1 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

For lung panel—Non-detected sample (sample no. 69, Table 3) was assigned Ct value of 41 and was calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51, 53, 54, 56, 57, 59, 61, 62, 64 and 70, Table 3 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 43 is a histogram showing over expression of the above-indicated C1ORF32 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 43, the expression of C1ORF32 transcripts detectable by the above amplicon in small cell carcinoma samples was significantly higher than in the non-cancerous samples (sample numbers 51, 53, 54, 56, 57, 59, 61, 62, 64 and 70, Table 3 above). Notably an over-expression of at least 7 fold was found in 9 out of 9 small cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below.

The P value for the difference in the expression levels of C1ORF32 transcripts detectable by the above amplicon in Lung small cell carcinoma samples versus the normal tissue samples was determined by T test as 2.34e-003.

Threshold of 7 fold over expression was found to differentiate between small cell carcinoma and normal samples with P value of 1.08e-005 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

For colon panel—Non-detected sample (sample no. 79, Table 5) was assigned Ct value of 41 and was calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-62 and 65-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 44 is a histogram showing down regulation of the above-indicated C1ORF32 transcripts in cancerous colon samples relative to the normal samples.

As is evident from FIG. 44, the expression of C1ORF32 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-62 and 65-70, Table 5 above). Notably down regulation of at least 5 fold was found in 15 out of 36 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. Threshold of 5 fold down regulation was found to differentiate between cancer and normal samples with P value of 4.29e-004 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

For normal panel—Non-detected samples (samples no. 42 and 49, Table 2) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the colon samples (sample numbers 4 and 5, Table 2 above), to obtain a value of relative expression of each sample relative to median of the colon samples, as shown in FIG. 45A. The normalized quantity of each RT sample was then divided by the median of the quantities of the lung samples (sample numbers 26, 29 and 30, Table 2 above), to obtain a value of relative expression of each sample relative to median of the lung samples, as shown in FIG. 45B.

For blood panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the kidney normal samples (sample numbers 65-67, Table 1 above), to obtain a value of relative expression of each sample relative to median of the kidney normal samples.

The results of this analysis are depicted in the histogram in FIG. 46. Expression of the above-indicated C1ORF32 transcript is high in one lymphoma sample (sample no. 33, Table 1) but in normal brain sample too.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H19011_junc8-10seg13F1 (SEQ ID NO: 239) forward primer; and H19011_junc8-10seg13R1 (SEQ ID NO: 240) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H19011_junc8-10seg13F1R1 (SEQ ID NO: 241).

Forward Primer >H19011_junc8-10seg13F1 (SEQ ID NO: 239) TGTGGAGATTATGCCAGAGTGG Reverse Primer >H19011 junc8-10seg13R1 (SEQ ID NO: 240) GACATTTCTCTCGATCGCTCTGT Amplicon >H19011 junc8-10seg13F1R1 (SEQ ID NO: 241) TGTGGAGATTATGCCAGAGTGGGTGTTTGTTGGCCTGGTGCTCCTGGGCG TCTTCCTCTTCTTCGTCCTGGTGGGGATCTGCTGGTGCCAGTGCTGCCCT CACAGCTGCTGCTGCTATGTCCGCTGCCCATGCTGCCCAGATTCCTGCTG GTGCCCTCAAGCCTGTGAGTACAGTGACCGCTGGGGAGACAGAGCGATCG AGAGAAATGTC

Expression of C1ORF32, chromosome 1 open reading frame 32, H19011 transcripts which are detectable by amplicon as depicted in sequence name H19011_junc6-10 (SEQ ID NO: 244) in normal and cancerous lung tissues and in normal and cancerous Colon tissues

Expression of C1ORF32 transcripts detectable by or according to junc6-10-H19011_junc6-10F1R1 (SEQ ID NO: 244) amplicon and primers H19011_junc6-10F1 (SEQ ID NO: 242) and H19011_junc6-10R1 (SEQ ID NO: 243) was measured by real time PCR on lung panel and colon panel. The samples used are detailed in Table 3 and Table 5 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Lung panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 51-64, 69 and 70, Table 3 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 47 is a histogram showing down regulation of the above-indicated C1ORF32 transcripts in cancerous Lung samples relative to the normal samples.

As is evident from FIG. 47, the expression of C1ORF32 transcripts detectable by the above amplicon in non-small cell carcinoma samples, adenocarcinoma and squamous cell carcinoma was significantly lower than in the non-cancerous samples (sample numbers 51-64, 69 and 70, Table 3 above). Notably down regulation of at least 5 fold was found in 23 out of 39 non-small cell carcinoma samples especially in 8 out of 17 adenocarcinoma samples and in 12 out of 16 squamous cell carcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of C1ORF32 transcripts detectable by the above amplicon lung non-small cell carcinoma, lung adenocarcinoma and lung squamous cell carcinoma samples, versus the normal tissue samples was determined by T test as 1.18e-003, 2.87e-002 and 3.55e-004, respectively.

Threshold of 5 fold down regulation was found to differentiate between lung non-small cell carcinoma, lung adenocarcinoma and lung squamous cell carcinoma samples and normal samples with P value of 1.59e-003, 3.54e-002 and 4.78e-004, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Colon panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 42-70, Table 5 above). Then the reciprocal of this ratio was calculated, to obtain a value of fold down-regulation for each sample relative to median of the normal samples.

FIG. 48 is a histogram showing down regulation of the above-indicated C1ORF32 transcripts in cancerous Colon samples relative to the normal samples.

As is evident from FIG. 48, the expression of C1ORF32 transcripts detectable by the above amplicon in cancer samples was significantly lower than in the non-cancerous samples (sample numbers 42-70, Table 5 above). Notably down regulation of at least 9 fold was found in 23 out of 55 adenocarcinoma samples.

Statistical analysis was applied to verify the significance of these results, as described below.

Threshold of 9 fold down regulation was found to differentiate between cancer and normal samples with P value of 7.39e-006 as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: H19011_junc6-10F1 (SEQ ID NO: 242) forward primer; and H19011_junc6-10R1 (SEQ ID NO: 243) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: H19011_junc6-10F1R1 (SEQ ID NO: 244).

Forward Primer >H19011_junc6-10F1 (SEQ ID NO: 242) ACTCTATTACTGTATTATCACCACCCCAG Reverse Primer >H19011 junc6-10R1 (SEQ ID NO: 243) CCAACAAACACCCACTCCAAC Amplicon >H19011 junc6-10F1R1 (SEQ ID NO: 244) ACTCTATTACTGTATTATCACCACCCCAGATGACCTGGAGGGGAAAAA TGAGGGCTCACTGGGACTGCTGGTGTTGGAGTGGGTGTTTGTTGG

Example 7 Description for Cluster R31375

The present invention relates to a specific antigen FXYD3 and related diagnostic and therapeutics based thereon.

According to the present invention, Cluster R31375 (internal ID 72360301) features 19 transcripts and 4 segments of interest, the names for which are given in Tables 105 and 106, respectively. The selected protein variants are given in table 107.

TABLE 105 Transcripts of interest Transcript Name R31375_T0 (SEQ ID NO: 51) R31375_T1 (SEQ ID NO: 52) R31375_T2 (SEQ ID NO: 53) R31375_T3 (SEQ ID NO: 54) R31375_T4 (SEQ ID NO: 55) R31375_T5 (SEQ ID NO: 56) R31375_T6 (SEQ ID NO: 57) R31375_T7 (SEQ ID NO: 58) R31375_T8 (SEQ ID NO: 59) R31375_T9 (SEQ ID NO: 60) R31375_T10 (SEQ ID NO: 61) R31375_T11 (SEQ ID NO: 62) R31375_T12 (SEQ ID NO: 63) R31375_T13 (SEQ ID NO: 64) R31375_T19 (SEQ ID NO: 65) R31375_T25 (SEQ ID NO: 66) R31375_T26 (SEQ ID NO: 67) R31375_T29 (SEQ ID NO: 68) R31375_T39 (SEQ ID NO: 69)

TABLE 106 Segments of interest Segment Name R31375_N30 (SEQ ID NO: 134) R31375_N33 (SEQ ID NO: 135) R31375_N34 (SEQ ID NO: 136) R31375_N37 (SEQ ID NO: 137)

TABLE 107 Proteins of interest Protein Name Corresponding Transcripts R31375_P0 (SEQ ID R31375_T0 (SEQ ID NO: 51); R31375_T1 (SEQ ID NO: 70) NO: 52); R31375_T10 (SEQ ID NO: 61); R31375_T11 (SEQ ID NO: 62); R31375_T12 (SEQ ID NO: 63); R31375_T13 (SEQ ID NO: 64); R31375_T2 (SEQ ID NO: 53); R31375_T3 (SEQ ID NO: 54); R31375_T4 (SEQ ID NO: 55); R31375_T5 (SEQ ID NO: 56); R31375_T6 (SEQ ID NO: 57); R31375_T7 (SEQ ID NO: 58); R31375_T8 (SEQ ID NO: 59); R31375_T9 (SEQ ID NO: 60) R31375_P14 (SEQ ID R31375_T19 (SEQ ID NO: 65); R31375_T25 (SEQ NO: 72) ID NO: 66); R31375_T26 (SEQ ID NO: 67) R31375_P31 (SEQ ID R31375_T29 (SEQ ID NO: 68) NO: 73) R31375_P33 (SEQ ID R31375_T39 (SEQ ID NO: 69) NO: 74)

These sequences are variants of the known protein FXYD domain-containing ion transport regulator 3 precursor (SwissProt accession identifier FXYD3_HUMAN (SEQ ID NO: 70); known also according to the synonyms Chloride conductance inducer protein Mat-8; Mammary tumor 8 kDa protein; Phospholemman-like), referred to herein as the previously known protein.

FXYD3 was previously identified within a set of genes induced by the neu or Ha-Ras oncogenes in murine breast tumors, and was named Mat-8 (Mammary tumor, 8 kDa) (Morrison et al 1995). In normal tissues, FXYD3 is mainly expressed in the uterus, stomach, colon and skin (Morrison et al 1995). Its expression was found elevated in human primary breast tumors, as well as prostate carcinoma and pancreatic ductal adenocarcinoma (Grzmil et al 2004, Kayed et al 2006). Specific inhibition of its expression by siRNA in prostate cancer cell lines indicates a role in cellular proliferation (Grzmil et al 2004).

FXYD3 belongs to the FXYD family proteins. The seven known members of this family are all small membrane proteins that contain a common signature of 6 amino acids comprising the FXYD motif. Most FXYD proteins, including human FXYD3, are type I membrane proteins, containing a transmembrane domain and a cleavable signal peptide. However, the signal peptide of mouse FXYD3 is not cleaved and the signal peptide may act as a second transmembrane domain (Crambert et al 2005, Geering 2006). FXYD3, like other members of the FXYD family, interacts with Na/K-ATPase and modulates its activity in a tissue-specific manner (Crambert et al 2005, Arimochi et al 2007, Geering 2006).

Two splice variant isoforms of FXYD3 have been previously described (Bibert et al 2006). These differ in a 26 amino acids in frame insertion after the transmembrane domain, and are differentially expressed during cell differentiation. Furthermore, both isoforms are able to stably associate with Na/K-ATPase and play different functional roles in the regulation of the activities of this ATPase (Bibert et al 2006).

In addition, WO 2003101283 may be relevant to the present invention. This PCT application purports to disclose that R31375_P0 (SEQ ID NO:70) (wild-type FXYD3 nucleic acid coding sequence reported herein) is a differentially expressed nucleic acid which encodes a protein sequences that allegedly may be used as a diagnostic marker for human lung cancer.

Further, WO2003000012 purports to disclose a human breast cancer related protein referred to as protein #12 that seems to correspond to the R31375-P0 (wild-type) disclosed herein. Also, U.S. Pat. No. 7,189,507 discloses a gene referred to as MAT8 in a long table of gene sequences that seems to correspond to R31375_P0 (SEQ ID NO:70). The table seems to suggest that this gene may be expressed in ovarian cancer.

Protein FXYD domain-containing ion transport regulator 3 precursor (SEQ ID NO:70) is known or believed to have the following functions: Induces a hyperpolarization-activated chloride current when expressed in Xenopus oocytes. May be a modulator capable of activating endogenous oocyte channels. Known polymorphisms for this sequence are as shown in Table 108.

TABLE 108 Amino acid mutations for Known Protein SNP positions on amino acid sequence Comment 36-37 Missing 58 S → SEWRSSGEQAGRGWGSPPLTTQLSPTG

Protein FXYD domain-containing ion transport regulator 3 precursor (SEQ ID NO:70) localization is believed to be Membrane; single-pass type I membrane protein (Potential).

The following GO Annotations apply to the previously known protein. The following annotations were found: chloride transport, which are annotations related to Biological Process; chloride channel activity, which are annotations related to Molecular Function; and integral to plasma membrane, which are annotations related to Cellular Component.

The GO assignment relies on information from one or more of the SwissProt/TremBl Protein knowledgebase, available from <http://www.expasy.ch/sprot/>; or Locuslink, available from <http://www.ncbi.nlm.nih.gov/projects/LocusLink/>.

According to the present invention, novel FXYD3 splice variants were identified as membrane bound proteins that are predicted to be over expressed in cancers. FXYD3 is a type I membrane bound protein (Bibert et. al 2006) and according to the present invention 3 novel splice variants of this protein are provided. The novel splice variant refered herein as R31375_P14 (SEQ ID NO:72) has an additional in-frame exon in its extracellular region. The addition of this exon increases the length of the extracellular region by 298 amino acids, which comprises a significant addition to rather short extracellular domain of 18 amino acids of the wild type protein. The novel splice variant refered herein as R31375_P31 (SEQ ID NO:73) has an additional in-frame exon in the juxtamembrane domain of FXYD3, which adds 26 new amino acids to the intracellular region. The novel splice variant refered herein as R31375_P33 (SEQ ID NO:74) skips the 3rd coding exon of the wild type FXYD3 and, like R31375_P31 (SEQ ID NO:73), has the additional in-frame exon in the juxtamembrane domain. This causes the deletion of 8 amino acids in the ectodomain.

According to the present invention FXYD3 and R31375_P14 (SEQ ID NO:72) were shown to be overexpressed in ovarian cancer.

As noted above, cluster R31375 features 19 transcripts, which were listed in Table 105 above. These transcripts encode for proteins which are variants of protein FXYD domain-containing ion transport regulator 3 precursor (SEQ ID NO:70). A description of each variant protein according to the present invention is now provided.

Variant protein R31375_P0 (SEQ ID NO:70) according to the present invention has an amino acid sequence encoded by transcripts R31375 T0 (SEQ ID NO:51), R31375_T1 (SEQ ID NO:52), R31375_T10 (SEQ ID NO:61), R31375_T11 (SEQ ID NO:62), R31375_T12 (SEQ ID NO:63), R31375_T13 (SEQ ID NO:64), R31375_T2 (SEQ ID NO:53), R31375_T3 (SEQ ID NO:54), R31375_T4 (SEQ ID NO:55), R31375_T5 (SEQ ID NO:56), R31375_T6 (SEQ ID NO:57), R31375_T7 (SEQ ID NO:58), R31375_T8 (SEQ ID NO:59) and R31375_T9 (SEQ ID NO:60).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein R31375_P0 (SEQ ID NO:70) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 109, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:70)).

TABLE 109 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 3 K -> R 19 D -> A 19 D -> V 50 A -> P 75 E ->

The coding portion of transcript R31375_T0 (SEQ ID NO:51) starts at position 491 and ends at position 751. The transcript also has the following SNPs as listed in Table 110 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 110 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 498, 900, 1571 A -> T 546 A -> C 546 T -> A 607, 926 T -> G 607 G -> C 638 G -> 713 C -> T 792 G -> A 901, 1572

The coding portion of transcript R31375_T1 (SEQ ID NO:52) starts at position 795 and ends at position 1055. The transcript also has the following SNPs as listed in Table 111 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 111 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> A 513, 1205, 1876 C -> A  773 A -> G 802, 1204, 1875 A -> T  850 A -> C  850 T -> A 911, 1230 T -> G  911 G -> C  942 G -> 1017 C -> T 1096

The coding portion of transcript R31375_T10 (SEQ ID NO:61) starts at position 1826 and ends at position 2086. The transcript also has the following SNPs as listed in Table 112 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 112 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A  469 A -> G 639, 984, 1504, 1833, 2235, 2906 A -> C 860, 1183, 1447, 1504, 1881 C -> 1063 A -> T 1183, 1447, 1881 T -> A 1942, 2261 T -> G 1942 G -> C 1973 G -> 2048 C -> T 2127 G -> A 2236, 2907

The coding portion of transcript R31375_T11 (SEQ ID NO:62) starts at position 613 and ends at position 873. The transcript also has the following SNPs as listed in Table 113 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 113 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 620, 1022, 1693 A -> T 668 A -> C 668 T -> A 729, 1048 T -> G 729 G -> C 760 G -> 835 C -> T 914 G -> A 1023, 1694

The coding portion of transcript R31375_T12 (SEQ ID NO:63) starts at position 711 and ends at position 971. The transcript also has the following SNPs as listed in Table 114 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 114 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A  469 G -> 545, 933 G -> C 545, 858 C ->  549 A -> G 718, 1120, 1791 A -> T  766 A -> C  766 T -> A 827, 1146 T -> G  827 C -> T 1012 G -> A 1121, 1792

The coding portion of transcript R31375_T13 (SEQ ID NO:64) starts at position 1015 and ends at position 1275. The transcript also has the following SNPs as listed in Table 115 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 115 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> A 513, 1425, 2096 C -> A  773 G -> 849, 1237 G -> C 849, 1162 C ->  853 A -> G 1022, 1424, 2095 A -> T 1070 A -> C 1070 T -> A 1131, 1450 T -> G 1131 C -> T 1316

The coding portion of transcript R31375_T2 (SEQ ID NO:53) starts at position 678 and ends at position 938. The transcript also has the following SNPs as listed in Table 116 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 116 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence G -> A 513, 1088, 1759 C -> A 656 A -> G 685, 1087, 1758 A -> T 733 A -> C 733 T -> A 794, 1113 T -> G 794 G -> C 825 G -> 900 C -> T 979

The coding portion of transcript R31375_T3 (SEQ ID NO:54) starts at position 572 and ends at position 832. The transcript also has the following SNPs as listed in Table 117 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 117 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 550 A -> G 579, 981, 1652 A -> T 627 A -> C 627 T -> A 688, 1007 T -> G 688 G -> C 719 G -> 794 C -> T 873 G -> A 982, 1653

The coding portion of transcript R31375_T4 (SEQ ID NO:55) starts at position 575 and ends at position 835. The transcript also has the following SNPs as listed in Table 118 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 118 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 582, 984, 1655 A -> T 630 A -> C 630 T -> A 691, 1010 T -> G 691 G -> C 722 G -> 797 C -> T 876 G -> A 985, 1656

The coding portion of transcript R31375_T5 (SEQ ID NO:56) starts at position 656 and ends at position 916. The transcript also has the following SNPs as listed in Table 119 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listedn).

TABLE 119 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 550 A -> G 663, 1065, 1736 A -> T 711 A -> C 711 T -> A 772, 1091 T -> G 772 G -> C 803 G -> 878 C -> T 957 G -> A 1066, 1737

The coding portion of transcript R31375_T6 (SEQ ID NO:57) starts at position 697 and ends at position 957. The transcript also has the following SNPs as listed in Table 120 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 120 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 704, 1106, 1777 A -> T 752 A -> C 752 T -> A 813, 1132 T -> G 813 G -> C 844 G -> 919 C -> T 998 G -> A 1107, 1778

The coding portion of transcript R31375_T7 (SEQ ID NO:58) starts at position 2475 and ends at position 2735. The transcript also has the following SNPs as listed in Table 121 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 121 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A  469 A -> G 639, 984, 1504, 2482, 2884, 3555 A -> C 860, 1183, 1447, 1504, 2530 C -> 1063 A -> T 1183, 1447, 2530 C -> T 1999, 2022, 2776 -> G 2279 -> A 2280 G -> T 2285 G -> C 2285, 2622 T -> A 2591, 2910 T -> G 2591 G -> 2697 G -> A 2885, 3556

The coding portion of transcript R31375_T8 (SEQ ID NO:59) starts at position 1329 and ends at position 1589. The transcript also has the following SNPs as listed in Table 122 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 122 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A  469 A -> G 639, 984, 1336, 1738, 2409 A -> C 860, 1183, 1384 C -> 1063 A -> T 1183, 1384 T -> A 1445, 1764 T -> G 1445 G -> C 1476 G -> 1551 C -> T 1630 G -> A 1739, 2410

The coding portion of transcript R31375_T9 (SEQ ID NO:60) starts at position 2586 and ends at position 2846. The transcript also has the following SNPs as listed in Table 123 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 123 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469, 2531, 2549 A -> G 639, 984, 1504, 2593, 2995, 3666 A -> C 860, 1183, 1447, 1504, 2641 C -> 1063 A -> T 1183, 1447, 2641 C -> T 1999, 2022, 2887 -> G 2279 -> A 2280 G -> T 2285 G -> C 2285, 2733 C -> G 2531, 2549 G -> A 2550, 2996, 3667 T -> A 2702, 3021 T -> G 2702 G -> 2808

Variant protein R31375_P14 (SEQ ID NO:72) according to the present invention has an amino acid sequence as encoded by transcripts R31375_T19 (SEQ ID NO:65), R31375_T25 (SEQ ID NO:66) and R31375_T26 (SEQ ID NO:67). Alignments to one or more previously published protein sequences are shown in FIG. 49A. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between R31375_P14 (SEQ ID NO:72) and known proteins FXYD3_HUMAN, NP_005962 and Q6IB59 HUMAN (SEQ ID NO: 70) (FIG. 49A):

A. An isolated chimeric polypeptide encoding for R31375_P14 (SEQ ID NO:72), comprising a first amino acid sequence being at least 90% homologous to MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYY corresponding to amino acids 1-32 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59 HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 1-32 of R31375_P14 (SEQ ID NO:72), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence GAPYIFVKRMGGQMKRTQAGTEVPSTFLL (SEQ ID NO: 294) corresponding to amino acids 33-61 of R31375_P14 (SEQ ID NO:72), and a third amino acid sequence being at least 90% homologous to

DWHSLQVGGLICAGVLCAMGIIIVMSAKCKCKFGQKSGHHPGETPPLITPG SAQS corresponding to amino acids 33-87 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59 HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 62-116 of R31375_P14 (SEQ ID NO:72), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of R31375_P14 (SEQ ID NO:72), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence GAPYIFVKRMGGQMKRTQAGTEVPSTFLL (SEQ ID NO: 294) of R31375_P14 (SEQ ID NO:72).

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein R31375_P14 (SEQ ID NO:72) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 124, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:72)).

TABLE 124 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 3 K -> R 19 D -> A 19 D -> V 46 M -> I 47 K -> E 79 A -> P 104 E ->

The coding portion of transcript R31375_T19 (SEQ ID NO:65) starts at position 491 and ends at position 838. The transcript also has the following SNPs as listed in Table 125 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 125 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 498, 629, 987, 1658 A -> T 546 A -> C 546 G -> A 628, 988, 1659 T -> A 694, 1013 T -> G 694 G -> C 725 G -> 800 C -> T 879

The coding portion of transcript R31375_T25 (SEQ ID NO:66) starts at position 575 and ends at position 922. The transcript also has the following SNPs as listed in Table 126 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 126 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 582, 713, 1071, 1742 A -> T 630 A -> C 630 G -> A 712, 1072, 1743 T -> A 778, 1097 T -> G 778 G -> C 809 G -> 884 C -> T 963

The coding portion of transcript R31375_T26 (SEQ ID NO:67) starts at position 1443 and ends at position 1790. The transcript also has the following SNPs as listed in Table 127 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 127 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A  469 A -> G 639, 984, 1450, 1581, 1939, 2610 A -> C 860, 1183, 1498 C -> 1063 A -> T 1183, 1498 G -> A 1580, 1940, 2611 T -> A 1646, 1965 T -> G 1646 G -> C 1677 G -> 1752 C -> T 1831

Variant protein R31375_P31 (SEQ ID NO:73) according to the present invention has an amino acid sequence as encoded by transcript R31375_T29 (SEQ ID NO:68). Alignments to one or more previously published protein sequences are given in FIGS. 49B and 49C. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between R31375_P31 (SEQ ID NO:73) and known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70) (FIG. 49B):

A. An isolated chimeric polypeptide encoding for R31375_P31 (SEQ ID NO:73), comprising a first amino acid sequence being at least 90% homologous to MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYDWHSLQVGGLICAGVLCAMGI IIVMS corresponding to amino acids 1-58 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 1-58 of R31375_P31 (SEQ ID NO:73), a second amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence EWRSSGEQAGRGWGSPPLTTQLSPTG (SEQ ID NO: 295) corresponding to amino acids 59-84 of R31375_P31 (SEQ ID NO:73), and a third amino acid sequence being at least 90% homologous to AKCKCKFGQKSG corresponding to amino acids 59-70 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 85-96 of R31375_P31 (SEQ ID NO:73), wherein said first amino acid sequence, second amino acid sequence and third amino acid sequence are contiguous and in a sequential order.

B. An isolated polypeptide encoding for an edge portion of R31375_P31 (SEQ ID NO:73), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence EWRSSGEQAGRGWGSPPLTTQLSPTG (SEQ ID NO: 295) of R31375_P31 (SEQ ID NO:73).

2. Comparison report between R31375_P31 (SEQ ID NO:73) and known proteins NP_068710 (SEQ ID NO: 71) (FIG. 49C):

A. An isolated chimeric polypeptide encoding for R31375_P31 (SEQ ID NO:73), comprising a amino acid sequence being at least 90% homologous to

MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYDWHSLQVGGLICAGVLCAM GIIIVMSEWRSSGEQAGRGWGSPPLTTQLSPTGAKCKCKFGQKSG corresponding to amino acids 1-96 of known proteins NP_068710 (SEQ ID NO: 71), which also corresponds to amino acids 1-96 of R31375_P31 (SEQ ID NO:73), wherein said and first amino acid sequence are contiguous and in a sequential order.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: membrane.

Variant protein R31375_P31 (SEQ ID NO:73) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 128, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:73)).

TABLE 128 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 3 K -> R 19 D -> A 19 D -> V 50 A -> P

Variant protein R31375_P31 (SEQ ID NO:73) is encoded by the transcript R31375_T29 (SEQ ID NO:68), for which the coding portion starts at position 491 and ends at position 778. The transcript also has the following SNPs as listed in Table 129 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 129 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 498, 1294, 1965 A -> T 546 A -> C 546 T -> A  607, 1320 T -> G 607 G -> C 638 G -> 1107  C -> T 1186  G -> A 1295, 1966

Variant protein R31375_P33 (SEQ ID NO:74) according to the present invention has an amino acid sequence as encoded by transcript R31375_T39 (SEQ ID NO:69). Alignments to one or more previously published protein sequences are given in FIGS. 49D and 49E. A brief description of the relationship of the variant protein according to the present invention to each such aligned protein is as follows:

1. Comparison report between R31375_P33 (SEQ ID NO:74) and known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70) (FIG. 49D):

A. An isolated chimeric polypeptide encoding for R31375_P33 (SEQ ID NO:74), comprising a first amino acid sequence being at least 90% homologous to MQKVTLGLLVFLAGFPVLDANDLE corresponding to amino acids 1-24 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 1-24 of R31375_P33 (SEQ ID NO:74), a second amino acid sequence being at least 90% homologous to DWHSLQVGGLICAGVLCAMGIIIVMS corresponding to amino acids 33-58 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 25-50 of R31375_P33 (SEQ ID NO:74), a third amino acid sequence being at least 70%, optionally at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95, 96, 97, 98 or 99% homologous to a polypeptide having the sequence EWRSSGEQAGRGWGSPPLTTQLSPTG (SEQ ID NO: 295) corresponding to amino acids 51-76 of R31375_P33 (SEQ ID NO:74), and a fourth amino acid sequence being at least 90% homologous to AKCKCKFGQKSG corresponding to amino acids 59-70 of known proteins FXYD3_HUMAN, NP_005962 and Q6IB59_HUMAN (SEQ ID NO: 70), which also corresponds to amino acids 77-88 of R31375_P33 (SEQ ID NO:74), wherein said first amino acid sequence, second amino acid sequence, third amino acid sequence and fourth amino acid sequence are contiguous and in a sequential order.

B. An isolated chimeric polypeptide encoding for an edge portion of R31375_P33 (SEQ ID NO:74), comprising a polypeptide having a length “n”, wherein n is at least about 10 amino acids in length, optionally at least about 20 amino acids in length, preferably at least about 30 amino acids in length, more preferably at least about 40 amino acids in length and most preferably at least about 50 amino acids in length, wherein at least two amino acids comprise ED, having a structure as follows: a sequence starting from any of amino acid numbers 24−x to 24; and ending at any of amino acid numbers 25+((n−2)−x), in which x varies from 0 to n−2.

C. An isolated polypeptide encoding for an edge portion of R31375_P33 (SEQ ID NO:74), comprising an amino acid sequence being at least 70%, optionally at least about 80%, preferably at least about 85%, more preferably at least about 90% and most preferably at least about 95, 96, 97, 98 or 99% homologous to the sequence EWRSSGEQAGRGWGSPPLTTQLSPTG (SEQ ID NO: 295) of R31375_P33 (SEQ ID NO:74).

2. Comparison report between R31375_P33 (SEQ ID NO:74) and known proteins NP_068710 (SEQ ID NO: 71) (FIG. 49E):

A. An isolated chimeric polypeptide encoding for R31375_P33 (SEQ ID NO:74), comprising a first amino acid sequence being at least 90% homologous to MQKVTLGLLVFLAGFPVLDANDLE corresponding to amino acids 1-24 of known proteins NP_068710 (SEQ ID NO: 71), which also corresponds to amino acids 1-24 of R31375_P33 (SEQ ID NO:74), and a second amino acid sequence being at least 90% homologous to

DWHSLQVGGLICAGVLCAMGIIIVMSEWRSSGEQAGRGWGSPPLTTQLSPT GAKCKCKFGQKSG corresponding to amino acids 33-96 of known proteins NP_068710 (SEQ ID NO: 71), which also corresponds to amino acids 25-88 of R31375_P33 (SEQ ID NO:74), wherein said first amino acid sequence and second amino acid sequence are contiguous and in a sequential order.

B. An isolated chimeric polypeptide encoding for an edge portion of R31375_P33 (SEQ ID NO:74), comprising a polypeptide having a length “n”, wherein n is at least about 10 amino acids in length, optionally at least about 20 amino acids in length, preferably at least about 30 amino acids in length, more preferably at least about 40 amino acids in length and most preferably at least about 50 amino acids in length, wherein at least two amino acids comprise ED, having a structure as follows: a sequence starting from any of amino acid numbers 24−x to 24; and ending at any of amino acid numbers 25+((n−2)−x), in which x varies from 0 to n−2.

The localization of the variant protein was determined according to results from a number of different software programs and analyses, including analyses from SignalP and other specialized programs. The variant protein is believed to be located as follows with regard to the cell: secreted.

Variant protein R31375_P33 (SEQ ID NO:74) also has the following non-silent SNPs (Single Nucleotide Polymorphisms) as listed in Table 130, (given according to their positions on the amino acid sequence, with the alternative amino acids listed (SEQ ID NO:74)).

TABLE 130 Amino acid mutations SNP positions on Alternative amino amino acid sequence acids 3 K -> R 19 D -> A 19 D -> V 42 A -> P

The coding portion of transcript R31375_T39 (SEQ ID NO:69) starts at position 491 and ends at position 754. The transcript also has the following SNPs as listed in Table 131 (given according to their position on the nucleotide sequence, with the alternative nucleic acid listed).

TABLE 131 Nucleic acid SNPs SNP positions on nucleotide Polymorphism sequence C -> A 469 A -> G 498, 1270, 1941 A -> T 546 A -> C 546 T -> A  583, 1296 T -> G 583 G -> C 614 G -> 1083  C -> T 1162  G -> A 1271, 1942

According to an optional embodiment of the present invention, short segments related to the above cluster are also provided. These segments are up to about 120bp in length, and so are included in a separate description.

Segment cluster R31375_N30 (SEQ ID NO:134) according to the present invention is supported by 7 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: R31375_T19 (SEQ ID NO:65), R31375_T25 (SEQ ID NO:66) and R31375_T26 (SEQ ID NO:67). Table 132 below describes the starting and ending position of this segment on each transcript.

TABLE 132 Segment location on transcripts Segment Segment Transcript name starting position ending position R31375_T19 (SEQ ID NO: 65) 588 674 R31375_T25 (SEQ ID NO: 66) 672 758 R31375_T26 (SEQ ID NO: 67) 1540 1626

Segment cluster R31375_N33 (SEQ ID NO:135) according to the present invention is supported by 278 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: R31375_T0 (SEQ ID NO:51), R31375_T1 (SEQ ID NO:52), R31375_T10 (SEQ ID NO:61), R31375_T11 (SEQ ID NO:62), R31375_T12 (SEQ ID NO:63), R31375_T13 (SEQ ID NO:64), R31375_T19 (SEQ ID NO:65), R31375_T2 (SEQ ID NO:53), R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T3 (SEQ ID NO:54), R31375_T39 (SEQ ID NO:69), R31375_T4 (SEQ ID NO:55), R31375_T5 (SEQ ID NO:56), R31375_T6 (SEQ ID NO:57), R31375_T7 (SEQ ID NO:58), R31375_T8 (SEQ ID NO:59) and R31375_T9 (SEQ ID NO:60). Table 133 below describes the starting and ending position of this segment on each transcript.

TABLE 133 Segment location on transcripts Segment Segment Transcript name starting position ending position R31375_T0 (SEQ ID NO: 51) 588 631 R31375_T1 (SEQ ID NO: 52) 892 935 R31375_T10 (SEQ ID NO: 61) 1923 1966 R31375_T11 (SEQ ID NO: 62) 710 753 R31375_T12 (SEQ ID NO: 63) 808 851 R31375_T13 (SEQ ID NO: 64) 1112 1155 R31375_T19 (SEQ ID NO: 65) 675 718 R31375_T2 (SEQ ID NO: 53) 775 818 R31375_T25 (SEQ ID NO: 66) 759 802 R31375_T26 (SEQ ID NO: 67) 1627 1670 R31375_T29 (SEQ ID NO: 68) 588 631 R31375_T3 (SEQ ID NO: 54) 669 712 R31375_T39 (SEQ ID NO: 69) 564 607 R31375_T4 (SEQ ID NO: 55) 672 715 R31375_T5 (SEQ ID NO: 56) 753 796 R31375_T6 (SEQ ID NO: 57) 794 837 R31375_T7 (SEQ ID NO: 58) 2572 2615 R31375_T8 (SEQ ID NO: 59) 1426 1469 R31375_T9 (SEQ ID NO: 60) 2683 2726

Segment cluster R31375_N34 (SEQ ID NO:136) according to the present invention is supported by 275 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: R31375_T0 (SEQ ID NO:51), R31375_T1 (SEQ ID NO:52), R31375_T10 (SEQ ID NO:61), R31375_T11 (SEQ ID NO:62), R31375_T12 (SEQ ID NO:63), R31375_T13 (SEQ ID NO:64), R31375_T19 (SEQ ID NO:65), R31375_T2 (SEQ ID NO:53), R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T3 (SEQ ID NO:54), R31375_T39 (SEQ ID NO:69), R31375_T4 (SEQ ID NO:55), R31375_T5 (SEQ ID NO:56), R31375_T6 (SEQ ID NO:57), R31375_T7 (SEQ ID NO:58), R31375_T8 (SEQ ID NO:59) and R31375_T9 (SEQ ID NO:60). Table 134 below describes the starting and ending position of this segment on each transcript.

TABLE 134 Segment location on transcripts Segment Segment Transcript name starting position ending position R31375_T0 (SEQ ID NO: 51) 632 662 R31375_T1 (SEQ ID NO: 52) 936 966 R31375_T10 (SEQ ID NO: 61) 1967 1997 R31375_T11 (SEQ ID NO: 62) 754 784 R31375_T12 (SEQ ID NO: 63) 852 882 R31375_T13 (SEQ ID NO: 64) 1156 1186 R31375_T19 (SEQ ID NO: 65) 719 749 R31375_T2 (SEQ ID NO: 53) 819 849 R31375_T25 (SEQ ID NO: 66) 803 833 R31375_T26 (SEQ ID NO: 67) 1671 1701 R31375_T29 (SEQ ID NO: 68) 632 662 R31375_T3 (SEQ ID NO: 54) 713 743 R31375_T39 (SEQ ID NO: 69) 608 638 R31375_T4 (SEQ ID NO: 55) 716 746 R31375_T5 (SEQ ID NO: 56) 797 827 R31375_T6 (SEQ ID NO: 57) 838 868 R31375_T7 (SEQ ID NO: 58) 2616 2646 R31375_T8 (SEQ ID NO: 59) 1470 1500 R31375_T9 (SEQ ID NO: 60) 2727 2757

Segment cluster R31375_N37 (SEQ ID NO:137) according to the present invention is supported by 254 libraries. The number of libraries was determined as previously described. This segment can be found in the following transcripts: R31375_T0 (SEQ ID NO:51), R31375_T1 (SEQ ID NO:52), R31375_T10 (SEQ ID NO:61), R31375_T11 (SEQ ID NO:62), R31375_T12 (SEQ ID NO:63), R31375_T13 (SEQ ID NO:64), R31375_T19 (SEQ ID NO:65), R31375_T2 (SEQ ID NO:53), R31375_T25 (SEQ ID NO:66), R31375_T26 (SEQ ID NO:67), R31375_T29 (SEQ ID NO:68), R31375_T3 (SEQ ID NO:54), R31375_T39 (SEQ ID NO:69), R31375_T4 (SEQ ID NO:55), R31375_T5 (SEQ ID NO:56), R31375_T6 (SEQ ID NO:57), R31375_T7 (SEQ ID NO:58), R31375_T8 (SEQ ID NO:59) and R31375_T9 (SEQ ID NO:60). Table 135 below describes the starting and ending position of this segment on each transcript.

TABLE 135 Segment location on transcripts Segment Segment Transcript name starting position ending position R31375_T0 (SEQ ID NO: 51) 663 699 R31375_T1 (SEQ ID NO: 52) 967 1003 R31375_T10 (SEQ ID NO: 61) 1998 2034 R31375_T11 (SEQ ID NO: 62) 785 821 R31375_T12 (SEQ ID NO: 63) 883 919 R31375_T13 (SEQ ID NO: 64) 1187 1223 R31375_T19 (SEQ ID NO: 65) 750 786 R31375_T2 (SEQ ID NO: 53) 850 886 R31375_T25 (SEQ ID NO: 66) 834 870 R31375_T26 (SEQ ID NO: 67) 1702 1738 R31375_T29 (SEQ ID NO: 68) 741 777 R31375_T3 (SEQ ID NO: 54) 744 780 R31375_T39 (SEQ ID NO: 69) 717 753 R31375_T4 (SEQ ID NO: 55) 747 783 R31375_T5 (SEQ ID NO: 56) 828 864 R31375_T6 (SEQ ID NO: 57) 869 905 R31375_T7 (SEQ ID NO: 58) 2647 2683 R31375_T8 (SEQ ID NO: 59) 1501 1537 R31375_T9 (SEQ ID NO: 60) 2758 2794

Expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc30-33 (SEQ ID NO: 247) in normal and cancerous Ovary tissues and in different normal tissues

Expression of FXYD3 domain containing ion transport regulator 3 transcripts detectable by or according to junc30-33-R31375_junc30-33 (SEQ ID NO: 247) amplicon and primers R31375_junc30-33F1 (SEQ ID NO: 245) and R31375_junc30-33R1 (SEQ ID NO: 246) was measured by real time PCR on ovary panel and normal panel. The samples used are detailed in Table 4 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Ovary panel—Non-detected samples (samples no. 33 and 53, Table 4,) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 50 is a histogram showing over expression of the above-indicated FXYD3 domain containing ion transport regulator 3 transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 50, the expression of FXYD3 domain containing ion transport regulator 3 transcripts detectable by the above amplicon in adenocarcinoma samples specifically mucinous carcinoma and endometroid samples was significantly higher than in the non-cancerous samples (sample numbers 52-78, Table 4 above). Notably an over-expression of at least 18 fold was found in 13 out of 37 adenocarcinoma samples, specifically 7 out of 9 mucinous carcinoma samples and in 4 out of 10 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of FXYD3 domain containing ion transport regulator 3 transcripts detectable by the above amplicon in Ovary adenocarcinoma samples, mucinous carcinoma samples and endometroid samples versus the normal tissue samples was determined by T test as 9.75e-004, 1.92e-002 and 1.55e-002, respectively.

Threshold of 18 fold over expression was found to differentiate between adenocarcinoma mucinous carcinoma samples and endometroid samples and normal samples with P value of 2.71e-004, 4.31e-006 and 3.18e-003, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—Non-detected samples (samples no. 50 and 54, Table 2) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 51.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: R31375_junc30-33F1 (SEQ ID NO: 245) forward primer; and R31375_junc30-33R1 (SEQ ID NO: 246) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: R31375_junc30-33 (SEQ ID NO: 247).

Forward Primer >R31375 junc30-33F1 (SEQ ID NO: 245):  GTGCTCCATATATATTTGTCAAGAGAATG Reverse Primer >R31375_junc30-33R1 (SEQ ID NO: 246):  GGAGGCTGTGCCAGTCTAGG Amplicon >R31375 junc30-33 (SEQ ID NO: 247): GTGCTCCATATATATTTGTCAAGAGAATGGGGGGACAGATGAAGAGGACA CAGGCTGGCACTGAGGTCCCCTCCACTTTCCTCCTAGACTGGCACAGCCT CC

Expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_seg33junc34-37 (SEQ ID NO: 250) in normal and cancerous Ovary tissues and in different normal tissues

Expression of FXYD3 domain containing ion transport regulator 3 transcripts detectable by or according to seg33junc34-37-R31375_seg33junc34-37 (SEQ ID NO: 250) amplicon and primers R31375_seg33junc34-37F1 (SEQ ID NO: 248) and R31375_seg33junc34-37R1 (SEQ ID NO: 249) was measured by real time PCR on ovary panel and normal panel. The samples used are detailed in Table 4 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

Ovary panel—Non-detected samples (samples no. 52, 61 and 70, Table 4) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 52 is a histogram showing over expression of the above-indicated FXYD3 domain containing ion transport regulator 3 transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 52, the expression of FXYD3 domain containing ion transport regulator 3 transcripts detectable by the above amplicon in adenocarcinoma samples—serous carcinoma, mucinous carcinoma and endometroid was significantly higher than in the non-cancerous samples (sample numbers 52-78, Table 4 above). Notably an over-expression of at least 85 fold was found in 20 out of 37 adenocarcinoma samples, specifically in 7 out of 18 serous carcinoma samples, in 8 out of 9 mucinous carcinoma samples and in 5 out of 10 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of FXYD3 domain containing ion transport regulator 3 transcripts detectable by the above amplicon in Ovary adenocarcinoma samples, serous carcinoma samples, mucinous carcinoma samples and endometriod samples versus the normal tissue samples was determined by T test as 1.61e-004, 5.40e-003, 1.49e-002 and 9.08e-003, respectively.

Threshold of 85 fold over expression was found to differentiate between adenocarcinoma, serous carcinoma, mucinous carcinoma and endometriod and normal samples with P value of 8.11e-007, 7.01e-004, 2.97e-007 and 5.78e-004, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

Normal panel—The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 53.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: R31375_seg33junc34-37F1 (SEQ ID NO: 248) forward primer; and R31375_seg33junc34-37R1 (SEQ ID NO: 249) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: R31375_seg33junc34-37 (SEQ ID NO: 250).

Forward Primer >R31375_seg33junc34-37F1 (SEQ ID NO: 248) ACTGGCACAGCCTCCAGG Reverse Primer >R31375_seg33junc34-37R1 (SEQ ID NO: 249) CATTTGCATTTTGCACTCATG Amplicon >R31375_seg33junc34-37 (SEQ ID NO: 250) ACTGGCACAGCCTCCAGGTTGGCGGGCTCATCTGCGCTGGGGTTCTGT GCGCCATGGGCATCATCATCGTCATGAGTGCAAAATGCAAATG

Expression of FXYD3 domain containing ion transport regulator 3 R31375 transcripts which are detectable by amplicon as depicted in sequence name R31375_junc20-22seg30F6R6 (SEQ ID NO: 253) in normal and cancerous ovary tissues and in different normal tissues.

Expression of FXYD3 domain containing ion transport regulator 3 transcripts detectable by or according to junc20-22seg30-R31375_junc20-22seg30F6R6 (SEQ ID NO: 253) amplicon and primers R31375_junc20-22seg30F6 (SEQ ID NO: 251) and R31375_junc20-22seg30R6 (SEQ ID NO: 252) was measured by real time PCR on ovary panel and normal panel. The samples used are detailed in Table 4 and Table 2 above, respectively. For each RT sample, the expression of the above amplicon was normalized to the normalization factor calculated from the expression of several house keeping genes as described in Example 1.

For ovary panel—Non-detected samples (samples no. 2, 6, 9, 12, 15, 19, 21, 24, 32, 34, 38, 45, 53, 56-59, 62, 63, 65-67 and 72-78, Table 4) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the normal samples (sample numbers 52, 53, 56-59, 62, 63, 65-67 and 72-78, Table 4 above), to obtain a value of fold up-regulation for each sample relative to median of the normal samples.

FIG. 54 is a histogram showing over expression of the above-indicated FXYD3 transcripts in cancerous Ovary samples relative to the normal samples.

As is evident from FIG. 54, the expression of FXYD3 transcripts detectable by the above amplicon in adenocarcinoma samples, serous carcinoma samples, mucinous carcinoma samples and endometroid samples was significantly higher than in the non-cancerous samples (sample numbers 52, 53, 56-59, 62, 63, 65-67 and 72-78, Table 4 above). Notably an over-expression of at least 14 fold was found in 21 out of 33 adenocarcinoma samples, 10 out of 16 serous carcinoma samples, in 5 out of 8 mucinous carcinoma samples and in 6 out of 9 endometroid samples.

Statistical analysis was applied to verify the significance of these results, as described below. The P value for the difference in the expression levels of FXYD3 transcripts detectable by the above amplicon in ovary adenocarcinoma samples versus the normal tissue samples was determined by T test as 3.78e-003.

Threshold of 14 fold over expression was found to differentiate between adenocarcinoma, serous carcinoma, mucinous carcinoma and endometriod and normal samples with P value of 4.21e-005, 5.17e-004, 4.46e-003 and 1.73e-003, respectively, as checked by exact Fisher test.

The above values demonstrate statistical significance of the results.

For normal panel—Non-detected samples (samples no. 1, 10, 11, 14, 17, 25, 29, 31-34, 38, 39, 46, 47, 49-54, 57, 58, 61-65, 68, 69 and 73, Table 2) were assigned Ct value of 41 and were calculated accordingly. The normalized quantity of each RT sample was then divided by the median of the quantities of the ovary samples (sample numbers 31-34, Table 2 above), to obtain a value of relative expression of each sample relative to median of the ovary samples, as shown in FIG. 55.

Primer pairs are also optionally and preferably encompassed within the present invention; for example, for the above experiment, the following primer pair was used as a non-limiting illustrative example only of a suitable primer pair: R31375_junc20-22seg30F6 (SEQ ID NO: 251) forward primer; and R31375_junc20-22seg30R6 (SEQ ID NO: 252) reverse primer.

The present invention also preferably encompasses any amplicon obtained through the use of any suitable primer pair; for example, for the above experiment, the following amplicon was obtained as a non-limiting illustrative example only of a suitable amplicon: R31375_junc20-22seg30F6R6 (SEQ ID NO: 253).

Forward Primer >R31375 junc20-22seg30F6 (SEQ ID NO: 251) TTGTGTTCCTGGCAGGCTTT Reverse Primer >R31375_junc20-22seg30R6 (SEQ ID NO: 252) TCATCTGTCCCCCCATTCTC Amplicon >R31375_junc20-22seg30F6R6 (SEQ ID NO: 253) TTGTGTTCCTGGCAGGCTTTCCTGTCCTGGACGCCAATGACCTAGAAGAT AAAAACAGTCCTTTCTACTATGGTGCTCCATATATATTTGTCAAGAGAAT GGGGGGACAGATGA

Example 8 Cloning of Full Length Transcripts Encoding VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 Fused to EGFP

Cloning of Full Length transcripts encoding VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 fused to EGFP was done as described below.

First, EGFP expression vector was constructed and then the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 open reading frames (ORFs) were cloned. EGFP was subcloned into pIRESpuro3 (Clontech catalog number: 631619) as follows: EGFP-N1 vector (Clontech cataloge number: 6085-1) was digested with NheI and NotI to excise the EGFP gene. The EGFP insert was then ligated into pIRESpuro3 (Clontech cataloge number: 631619), which was previously digested with the same enzymes, in order to obtain the EGFP-pIRESpuro3 vector.

Cloning of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32, FXYD3 open reading frames (ORFs) was done using the following steps:

1. A reverse transcription reaction was carried out as follows: 10 μg of purified RNA was mixed with 150 ng Random Hexamer primers (Invitrogen, Carlsbad, Calif., USA, catalog number: 48190-011) and 500 μM dNTPs in a total volume of 156 μl. The mixture was incubated for 5 min at 65□C and then quickly chilled on ice. Thereafter, 50 μl of 5× SuperscriptII first strand buffer (Invitrogen, catalog number: 18064-014, part number: Y00146), 24 μl 0.1M DTT and 400 units RNasin (Promega, Milwaukee, Wis., U.S.A., catalog number: N2511) were added, and the mixture was incubated for 10 min at 25□C, followed by further incubation at 42□C for 2 min. Then, 10 μl (2000 units) of SuperscriptII (Invitrogen, catalog number: 18064-014) was added and the reaction (final volume of 250 μl) was incubated for 50 min at 42□C and then inactivated at 70□C for 15 min. The resulting cDNA was diluted 1:20 in TE buffer (10 mM Tris, 1 mM EDTA pH 8).

2. PCR was done using Platinum PFX™ (Invitrogen., Carlsbad, Calif., USA, catalog number: 1178-021) under the following conditions: 5 μl Platinum PFX 10× buffer; 5 μl—cDNA from the above; 2 μl-10 mM dNTPs (2.5 mM of each nucleotide); 0.5 μl—Platinum PFX enzyme; 37 μl—H2O; and 1.5 μl—of each primer (15 μM) in a total reaction volume of 50 μl; with a reaction program of 5 minutes in 95° C.; 35 cycles of: 30 seconds at 94° C., 30 seconds at 55° C., 50 seconds at 68° C.; then 10 minutes at 68° C. Primers which were used include gene specific sequences corresponding to the desired coordinates of the protein and restriction enzyme sites and Kozak sequence, as listed in table 136, below. Bold letters in Table 136 represent the specific gene sequence while the restriction site extensions utilized for cloning purposes are in Italic and kozak sequences are underlined.

Table 136 demonstartes the cloning steps of ORF targets. For example, FXYD3_T25_P14 and VSIG1_T6_P5 were cloned by PCR amplification of two overlapping fragments of the full length at step 1, followed by additional PCR at step 2 using both PCR fragments from step 1 as a template for generating the full length. VSIG1_T5_P4 was cloned using both PCR fragments generated at step 1 for digestion and direct ligation, AI216611_T1_P1 was cloned by performing nested PCR on the PCR product generated from step 1. 5 μl of products No. 1, 4, 5, 8, 9, 10, 11, 12, 15, 16 and 17 (Table 136), were loaded onto a 1% agarose gel stained with ethidium bromide, electrophoresed in 1×TBE solution at 100V, and visualized with UV light. After verification of expected size band, remaining PCR product was processed for DNA purification using Qiaquick PCR purification kit (Qiagen™, Valencia, Calif., U.S.A., catalog number 28106). The extracted PCR products were digested with the appropriate restriction enzymes (New England Biolabs, Beverly, Mass., U.S.A.), as listed in table 136. After digestion, DNAs were loaded onto a 1% agarose gel as described above. The expected band size was excised and extracted from the gel using QiaQuick™ Gel Extraction kit (Qiagen, catalog number: 28707).

The digested targets' ORF DNAs were ligated to EGFP_pIRESpuro3 vector using the LigaFast™ Rapid DNA Ligation System (Promega, catalog number: M8221.). The resulting DNAs were transformed into competent E. Coli bacteria DH5α (RBC Bioscience, Taipei, Taiwan, catalog number: RH816) according to manufacturer's instructions, then plated on LB-ampicillin agar plates for selection of recombinant plasmids, and incubated overnight at 37° C.

The following day, a number of colonies from each transformation that grew on the selective plates were taken for further analysis by streak-plating on another selective plate and by PCR using GoTaq ReadyMix (Promega, catalog number: M7122.). Screening positive clones was performed by PCR using pIRESpuro3 vector specific primer and gene specific primer (data not shown). After completion of all PCR cycles, half of the reaction was analyzed using 1% agarose gel as described above. After verification of expected size band, 2 positive colonies from each ligation reactions were grown in 5 ml Terrific Broth supplemented with 100 μg/ml ampicillin, with shaking overnight at 37° C. Plasmid DNA was isolated from bacterial cultures using Qiaprep™ Spin Miniprep Kit (Qiagen, catalog number: 27106). Accurate cloning was verified by sequencing the inserts (Weizmann Institute, Rehovot, Israel). Upon verification of an error-free colony (i.e. no mutations within the ORF), recombinant plasmids were processed for further analysises.

The DNA sequences of the resulting VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 full length fused to EGFP are shown in FIGS. 56A-J. In FIGS. 56A-J gene specific sequence correspond to the target's full length sequence is marked in bold faced, EGFP sequence is unbold Italic and known SNPs/silence mutations are underlined. FIG. 56A presents the DNA sequence of FXYD3 T0_P0_EGFP (996bp)(SEQ ID NO:77); FIG. 56B presents the DNA sequence of FXYD3_T25_P14_EGFP (1083bp) (SEQ ID NO:78); FIG. 56C presents the DNA sequence of AI216611_T0_P0_EGFP (1371bp) (SEQ ID NO:79); FIG. 56D presents the DNA sequence of AI216611_T1_P1_EGFP (1332bp) (SEQ ID NO:80); FIG. 56E presents the DNA sequence of C1ORF32_T8_P8 EGFP (1533bp) (SEQ ID NO:81); FIG. 56F presents the DNA sequence of LOC253012_T4_P5_EGFP (2085bp) (SEQ ID NO:82); FIG. 56G presents the DNA sequence of ILDR1_T0_P3 EGFP DNA sequence (2373bp) (SEQ ID NO:83); FIG. 56H presents the DNA sequence of ILDR1_T2_P5_EGFP (2241bp) (SEQ ID NO:84); FIG. 56I presents the DNA sequence of VSIG1_T6_P5_EGFP (2082bp) (SEQ ID NO:85); FIG. 56J presents the DNA sequence of VSIG1_T5_P4_EGFP DNA (2004bp) (SEQ ID NO:86).

The amino acid sequences of the resulting VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 full length fused to EGFP are shown in FIG. 57A-J; gene specific sequence correspond to the full length sequence of the protein is marked in bold faced, EGFP sequence is unbold Italic and amino acids modified due to known SNPs are underlined. FIG. 57A presents the amino acid sequence of FXYD3_P0_EGFP protein (331aa) (SEQ ID NO:87); FIG. 57B presents the amino acid sequence of FXYD3_P14_EGFP protein (360aa) (SEQ ID NO:88); FIG. 57C presents the amino acid sequence of AI216611_P0_EGFP protein (456aa) (SEQ ID NO:89); FIG. 57D presents the amino acid sequence of AI216611_P1_EGFP protein (443aa) (SEQ ID NO:90); FIG. 57E presents the amino acid sequence of C1ORF32_P8 EGFP protein (510aa) (SEQ ID NO:91); FIG. 57F presents the amino acid sequence of LOC253012_P5_EGFP protein (694aa) (SEQ ID NO:92); FIG. 57G presents the amino acid sequence of ILDR1_P3 EGFP protein (790aa) (SEQ ID NO:93); FIG. 57H presents the amino acid sequence of ILDR1_P5_EGFP protein (746aa) (SEQ ID NO:94); FIG. 57I presents the amino acid sequence of VSIG1_P5_EGFP protein (693aa) (SEQ ID NO:95); FIG. 57J presents the amino acid sequence of VSIG1_P4_EGFP protein (667aa) (SEQ ID NO:96).

TABLE 136 full length cloning details Full PCR CGEN Target length product DNA Primer Primer Primer Restriction ID name (aa) No template ID sequence orientation site CGEN FXYD3_T0_P0 87 1 ovary 30, 100-813 CTAGCTA GCCACC For NheI 789 39, 59 (SEQ ID ATGCAGAAGGTGA cDNA NO: 254) CCCTG (Table 4) 100-814 CGCGACCGGTCCG Rev AgeI (SEQ ID CTTTGGGCTGAGC NO: 255) CTGG FXYD3_T25_P14 116 2 lung 1, 100-813 CTAGCTA GCCACC For NheI 19, 20, (SEQ ID ATGCAGAAGGTGA 37, 42 NO: 254) CCCTG cDNA 100-843 CCTGTGTCCTCTT Rev (Table 3) (SEQ ID CATCTGTC NO: 256) 3 lung 100-842 GACAGATGAAGAG For 1, 19, 20, (SEQ ID GACACAGG 37, 42 NO: 257) cDNA 100-814 CGCGACCGGTCCG Rev AgeI (Table 3) (SEQ ID CTTTGGGCTGAGC NO: 255) CTGG 4 PCR 100-813 CTAGCTA GCCACC For NheI products (SEQ ID ATGCAGAAGGTGA No 2 + 3 NO: 254) CCCTG above 100-814 CGCGACCGGTCCG Rev AgeI (SEQ ID CTTTGGGCTGAGC NO: 255) CTGG CGEN AI216611_T0_P0 200 5 lung 49 100-740 CTAGCTA GCCACC For NheI 721 cDNA (SEQ ID ATGAGGCCTCTGC (Table 3) NO: 258) CCAGCG 100-741 CGCGAATTCGACA Rev EcoRI (SEQ ID CTCAACATCTTCC NO: 259) AGCTC AI216611_T1_P1 199 6 lung 4 100-738 AAGGCTGCATAGG For cDNA (SEQ ID AGCTG (Table 3) NO: 260) 100-919 CAATGAGTTGGAA Rev (SEQ ID ATCAAGCCAC NO: 261) 7 PCR 100-740 CTAGCTA GCCACC For NheI product (SEQ ID ATGAGGCCTCTGC No 6 above NO: 258) CCAGCG 100-919 CAATGAGTTGGAA Rev (SEQ ID ATCAAGCCAC NO: 261) 8 PCR 100-740 CTAGCTA GCCACC For NheI product (SEQ ID ATGAGGCCTCTGC No 7 above NO: 258) CCAGCG 100-836 CGCGACCGGTCCA Rev AgeI (SEQ ID AACCACTCATGGA NO: 262) TTATCACAAGGCC CAGGGGGTTACCT TTGAGTTTGTGTC TTCTC CGEN C1ORF32_T8_P8 254 9 lung 100-746 CTAGCTA GCCACC For NheI 754 44, 45, 48 (SEQ ID ATGGATAGGGTCT cDNA NO: 263) TGCTGAG (Table 3) 100-694 CGCGAATTCGGGT Rev EcoRI (SEQ ID AGAGAGGTAGACA NO: 264) TTTC CGEN LOC253012_T4_P5 450 10 IMAGE 100-765 GCGCTTCGAA GCC For BstBI 702 clone (SEQ ID ACCATGTGGCTCA BD139906.1 NO: 265) AGGTCTTCAC 100-766 CGCGACCGGTCCC Rev AgeI (SEQ ID TCTGGATGGTCTT NO: 266) GCTGCTG CGEN ILDR1_T0_P3 546 11 ovary 100-780 CTAGCTA GCCACC For NheI 770 19, 20, 27 (SEQ ID ATGGCATGGCCCA NheI cDNA NO: 267) AACTGCC (Table 4) 100-781 CGCGACCGGTCCA Rev AgeI (SEQ ID ATGACCACACTCC NO: 268) TTCCACTA ILDRI_T2_P5 502 12 ovary 100-780 CTAGCTA GCCACC For NheI 19, 20, 27 (SEQ ID ATGGCATGGCCCA cDNA NO: 267) AACTGCC (Table 4) 100-781 CGCGACCGGTCCA Rev AgeI (SEQ ID ATGACCACACTCC NO: 268) TTCCACTA CGEN VSIG1_T6_P5 449 13 lung 17 100-783 CTAGCTA GCCACC For NheI 768 cDNA (SEQ ID ATGGTGTTCGCAT (Table 3) NO: 269) TTTGGAAG 100-838 CTGGAGTTCAGCC Rev (SEQ ID TGCTGTCCATCAAG NO: 270) AG 14 lung 17 100-837 CTCTTGATGGACA For cDNA (SEQ ID GCAGGCTGAACTC (Table 3) NO: 271) CAG C 100-782 CGCGACCGGTCCT Rev AgeI (SEQ ID GCCTTAACCACTC NO: 272) CCTTTTC 15 PCR 100-783 CTAGCTA GCCACC For NheI produts (SEQ ID ATGGTGTTCGCAT No 13 + 14 NO: 269) TTTGGAAG above 100-782 CGCGACCGGTCCT Rev AgeI (SEQ ID GCCTTAACCACTC NO: 272) CCTTTTC VSIG1_T5_P4 423 16 lung 100-783 CTAGCTA GCCACC For NheI 4 cDNA (SEQ ID ATGGTGTTCGCAT (Table 3) NO: 269) TTTGGAAG 100-785 CCTCAGTAC Rev ScaI (SEQ ID T GAGGCACGAGC NO: 273) TGTG 17 lung 100-784 CCTC AGTACT GAG For ScaI 4 cDNA (SEQ ID GGTATGG (Table 3) NO: 274) 100-782 CGCGACCGGTCCT Rev AgeI (SEQ ID GCCTTAACCACTC NO: 272) CCTTTTC

Example 9 Determining Cell Localization of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3

In order to determine the cellular localization of the protein targets, they were cloned as EGFP (Enhanced Green Fluorescent Protein) fusion proteins. Proteins localization was observed upon transient transfection (Chen et al., Molecular vision 2002; 8; 372-388) using the confocal microscope. The cells were observed for the presence of fluorescent products 48 hours following transfection.

Determining cell localization of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 was done by transient transfection of the recombinant ORF-EGFP constructs which were described above.

The VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3-EGFP pIRESpuro3 constructs were subsequently transiently transfected into HEK-293T cells as follows:

HEK-293T (ATCC, CRL-11268) cells were plated on sterile glass coverslips, 13 mm diameter (Marienfeld, catalog number: 01 115 30), which were placed in a 6 well plate, using 2 ml pre-warmed DMEM [Dulbecco's modified Eagle's Media, Biological Industries (Beit Ha'Emek, Israel), cataloge number: 01-055-1A]+10% FBS (Fetal Bovin Serum)+4 mM L-Glutamine. 500,000 cells per well were transfected with 2 μg of the DNA construct using 6 μl FuGENE 6 reagent (Roche, catalog number: 11-814-443-001) diluted into 94 ul DMEM. The mixture was incubated at room temperature for 15 minutes. The complex mixture was added dropwise to the cells and swirled. Cells were placed in incubator maintained at 37° C. with 5% CO2 content.

48 hours post transient transfection the cells were further processed for analysis in confocal microscopy. The cover slips were washed 3 times in phosphate buffered saline (PBS) and fixed for 15 minutes with 3.7% or 1% paraformaldehyde (PFA) (Sigma, catalog number: P-6148). After 2 washes in PBS, the fixed coverslips were glued to a slide using mounting solution (Sigma, catalog number: G0918) and cells were observed for the presence of fluorescent product using confocal microscope. The results are presented in FIG. 58A-F.

FIG. 58A demonstrates that the AI216611_P0_EGFP (SEQ ID NO:89) and AI216611_P1_EGFP (SEQ ID NO:90) fused proteins localizes to cell membrane upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

FIG. 58B demonstrates that the FXYD3_P0_EGFP (SEQ ID NO:87) and FXYD3_P14_EGFP (SEQ ID NO:88) fused proteins localizes to cell membrane upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

FIG. 58C demonstrates that the C1ORF32_P8 EGFP (SEQ ID NO:91) fused protein localizes to cell membrane; endoplasmatic reticulum (ER) membrane and to cell junctions upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

FIG. 58D demonstrates that the LOC253012_P5_EGFP (SEQ ID NO:92) fused protein localizes to cell membrane and endoplasmatic reticulum (ER) membrane upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

FIG. 58E demonstrates that the VSIG1_P5_EGFP (SEQ ID NO:95) and VSIG1-P4_EGFP (SEQ ID NO:96) fused proteins localizes to nuclear cell membrane and endoplasmatic reticulum membrane upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

FIG. 58F demonstrates that the ILDR1_P3 EGFP (SEQ ID NO:93) and ILDR1_P5_EGFP (SEQ ID NO:94) fused proteins localizes to cell membrane and endoplasmatic reticulum membrane upon expression in HEK 293T cells. The image was obtained using the 40× objective of the confocal microscope.

Example 10 Cloning and Expression of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 Extra Cellular Domain (ECD) Fused to Mouse Fc

The purpose of this analysis was to clone the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECDs fused via its corresponding C′ terminus to mouse Fc (mFc), and to express the fused ECDs in HEK293T cells (ATCC-CRL-11268), in order to be further used for antibody production as well as for functional assessment of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECDs.

The coordinates of the cloned ECD are described in table 137:

TABLE 137 Recombinant Full ECD CG protein Transcript Protein length Coordinates SEQ EN ID name No. No. (aa) (aa) ID CG FXYD3 T25 P14 116 1-63 SEQ EN789 (SEQ ID (SEQ ID ID NO: 66) NO:72) No.- 297 CG AI216611 T0 P0 200  1-145 SEQ EN721 (SEQ ID (SEQ ID ID NO: 41) NO: 43) No.- 298 CG C1ORF32 T8 P8 254  1-184 SEQ EN754 (SEQ ID (SEQ ID ID NO: 45) NO: 48) No.- 299 CG LOC253012 T4 P5 450  1-335 SEQ EN702 (SEQ ID (SEQ ID ID NO: 26) NO: 36) No.- 300 CG ILDR1 T0 P3 546 51-160 SEQ EN770 (SEQ ID (SEQ ID ID NO: 17) NO: 22) No.- 301 CG VSIG1 T6 P5 449 26-293 SEQ EN768 (SEQ ID (SEQ ID ID NO: 7) NO: 13) No.- 302

The cloning of the fusion proteins (ECD_mFc) was done in two steps:

1. Cloning of ECD to pIRESpuro3.

2. Subcloning of the mouse Fc IgG2a in frame to the C′ terminus of the ECD previously cloned into pIRESpuro3, from step1.

The cloning of ECD to pIRESpuro3 was carried out as follows:

Cloning of the ECD for each one of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 was done by PCR delimit partial amino acids sequence of its ECD as described in table 137, using its full length sequence as a template, and primers as listed in table 138.

TABLE 138 ECD cloning details CGEN candidate primer Primer restriction ID name ID primer sequence orientation site CGEN789 FXYD3 100-813 CTAGCTA GCCACC ATGCAGAAGG For NheI SEQ ID TGACCCTG NO: 254 100-852 CGCGGATCC CCAGTCTAGGAGGA Rev BamHI SEQ ID AAGTGG NO: 275 CGEN721 AI216611 100-740 CTAGCTA GCCACC ATGAGGCCTC For NheI SEQ ID TGCCCAGCG NO: 258 100-850 CGCGGATCC GTCTTCATAGAGGA Rev BamHI SEQ ID TCTCAG BamHI NO: 276 CGEN754 C1ORF32 100-746 CTAGCTA GCCACC ATGGATAGGG For NheI SEQ ID TCTTGCTGAG NO: 263 100-851 CGCGGATCC CATAATCTCCACAG  Rev BamHI SEQ ID CAAAAC NO: 277 CGEN702 LOC253012 100-789 AACCGGT GCCACC ATGTGGCTCA For AgeI SEQ ID AGGTCTTCAC NO: 278 100-854 CGCGGATCC TTTTCCTTTCTGTGC Rev BamHI SEQ ID AAGCT NO: 279 CGEN770 ILDRI 100-873 GCGTTCGAA GCCCAGCTCCAGGA For BstBI SEQ ID CGTGGTG NO: 280 100-853 CGCGGATCC TTCCTTATCGGGGT Rev BamHI SEQ ID CTCCTG NO: 281 CGEN768 VSIG1 100-867 GCGCTTCGAA ATCCCAGACGGTT For BstBI SEQ ID TCGTG NO: 282 100-855 CGCGGATCC TGGATGTGAAGAAG Rev BamHI SEQ ID TGAGAT NO: 283

In Table 138, above the bold letters represent the gene specific sequence while the restriction site extensiuons utilized for cloning purposes are Italic and Kozak sequence is underlined.

The PCR products were purified and digested with the appropriate restriction enzymes as describe in table 138. PCR products for FXYD3, AI216611, C1ORF32 and LOC253012 were ligated into pIRESpuro3, while PCR products for VSIG1 and ILDR1 were ligated into IL6sp pIRESpuro3 in order to increase their secretion. The ligation mixture was transformed into DH5a competent cells. Positive transformants were screened and verified by DNA sequencing.

Cloning of ECD-mFc pIRESpuro3

Mouse Fc (IgG2a) (Accession-CAA49868 aa 237-469) protein sequence followed by TEV cleavage site sequence was codon optimized to boost protein expression in mammalian system. The optimized sequence was synthesized by GeneArt (Germany) with flanking BamHI restriction site at the N′ terminus and NotI restriction site at the C′ terminus. The DNA fragment was digested with BamHI/NotI and ligated in frame into ECD_pIRESpuro3 constructs previously digested with the same enzymes to give ECD_mFc pIRESpuro3. The ligation mixture was transformed into DH5a competent cells. Positive transformants were screened and verified by DNA sequencing.

The nucleotide sequences of the resulting ECD_mFc ORFs are shown in FIG. 59A-F: gene specific sequence correspond to the ECD sequence is marked in bold faced, TEV cleavage site sequence is underlined, mFc sequence is unbold Italic and IL6sp sequence is bold Italic. FIG. 59A shows the FXYD3_T25_P14_ECD_mFc DNA sequence (924bp) (SEQ ID NO:97); FIG. 59B shows the AI216611_T0_P0_ECD_mFc DNA sequence (1170bp) (SEQ ID NO:98), FIG. 59C shows the C1ORF32_T8_P8_ECD_mFc DNA sequence (1287bp) (SEQ ID NO:99); FIG. 59D shows the LOC253012_T4_P5_ECD_mFc DNA sequence (1740bp) (SEQ ID NO:100), FIG. 59E shows the ILDR1_T0_P3_ECD_mFc DNA sequence (1167bp) (SEQ ID NO:101), and FIG. 59F shows the VSIG1_T6_P5_ECD_mFc DNA sequence (1641bp) (SEQ ID NO:102).

The sequence of the resulting ECD_mFc fusion proteins are shown in FIG. 60A-60F; gene specific sequence correspond to the ECD sequence is marked in bold faced, TEV cleavage site sequence is underlined, mFc sequence is unbold Italic and IL6sp sequence is bold Italic. FIG. 60A shows the FXYD3_T25_P14_ECD_mFc amino acid sequence (307aa) (SEQ ID NO:103); FIG. 60B shows the AI216611_T0_P0_ECD_mFc amino acid sequence (389aa) (SEQ ID NO:104), FIG. 60C shows the C1ORF32_T8_P8_ECD_mFc amino acid sequence (428aa) (SEQ ID NO:105); FIG. 60D shows the LOC253012_T4_P5_ECD_mFc amino acid sequence (579aa) (SEQ ID NO:106), FIG. 60E shows the ILDR1_T0_P3_ECD_mFc amino acid sequence (388aa) (SEQ ID NO:107), and FIG. 60F shows the VSIG1_T6_P5_ECD_mFc amino acid sequence (546aa) (SEQ ID NO:108).

To generate ECD-mFc expressing cells, HEK-293T cells were transfected with the above described constructs corresponding to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 extra cellular domain fused to mouse Fc. Stable pools were generated as follows 48 hrs post transfection, the cells were trypsinized and transferred to T75 flask containing selection medium (DMEM 10% FCS and 5 μg/ml puromycin) for obtaining stable pool. Media was changed every 3 to 4 days until colonies formation.

To verify the identity of cells, genomic PCR was performed, indicating the correct sequences integrated into the cell genome (data not shown).

Cell-deprived medium was collected and purified by Protein A-Sepharose beads (Amersham catalog number 17-5280-04) as follows: 1 ml of cell-deprived medium was incubated with 50 μl Protein A sepharose beads for 45 minutes at room temperature. At the end of incubation time proteins were eluted from the beads pellet with 50 μl sample buffer containing 100 mM Citrate Phosphate pH 3.5 and 10 mM DTT. The samples were boiled for 3 minutes and 25 μl were loaded on 12% NuPAGE Bis Tris gel (Invitrogen, catalog number NPO342). The proteins were transferred to a nitrocellulose membrane and blocked with 10% low fat milk in PBST (PBS supplemented with 0.05% tween-20). The membrane was then blotted for 1 hour with Goat anti mouse IgG Fc fragment HRP (Jackson, catalog number 115-035-206.) (1:40,000 in blocking solution) at room temperature. Following incubation with ECL solution (Amersham Biosciences, Catalog No. RPN2209), the membrane was exposed to film.

FIG. 61 shows the results of a western blot on expressed FXYD3_ECD_mFc (SEQ ID NO:103), AI216611 ECD_mFc (SEQ ID NO:104), C1ORF32_ECD_mFc (SEQ ID NO:105), LOC253012_ECD_mFc (SEQ ID NO:106), ILDR1_ECD_mFc (SEQ ID NO:107), VSIG1_ECD_mFc (SEQ ID NO:108) according to the present invention.

The lanes are as follows: lane 1 Molecular weight markers (Amersham, full range ranbow, catalog number RPN800); lane 2-LOC253012_ECD_mFc (SEQ ID NO:106); lane 3-FXYD3_ECD_mFc (SEQ ID NO:103); lane 4-AI216611 ECD_mFc (SEQ ID NO:104); lane 5-C1ORF32_ECD_mFc (SEQ ID NO:105); lane 6-ILDR1_ECD_mFc (SEQ ID NO:107); lane 7-VSIG1_ECD_mFc (SEQ ID NO:108).

Example 11 Protein Production of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 Extra Cellular Domain (ECD) Fused T0 Mouse Fc

To produce VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3_ECD fused to mouse Fc, pool of transfected HEK293T cells stably transfected with the corresponding constructs described herein above, were used. The transfected cells, usually maintained in 10% serum supplemented medium, were transferred into serum free medium (EX-CELL293, SAFC) supplemented with 4 mM glutamine and selection antibiotics (5 ug/ml puromycin), and grown in suspension in shake flasks at 37° C., with agitation. The culture volume was increased by sequential dilutions until a production phase of 3-4 days carried out in 2 L spinners flasks. Medium from the spinners was harvested, cleared from cells by centrifugation, filtered through a 0.22 μm filter and kept at −20° C.

The VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 ECD fused to mouse Fc were purified using nProtein A-affinity chromatography as described below.

Harvests were concentrated approximately 10 fold using PALL ultrafiltration system on two 10 kD cassettes. The concentrate was then adjusted to pH 7.5, by the addition of 5M NaOH and filtrated through 0.2 μm Stericup filter.

Purification process was carried out using AKTA Explorer (GE Healthcare). 2 ml of nProtein A Sepharose™, Fast Flow resin (cat#17-5280-02) were washed on Poly-prep chromatograohy column under vacumn with 10 column volumes (CV) of 70% ethanol, 10 CV WFI (Sterile Water for Irrigation (TEVA)) followed by 10CV buffer A. 2 ml resin were transffered into two 500 ml tubes (1 ml each) and the concentrated harvest was added. The tube was icubated overnight at 4° C. on a roller to allow binding of the protein. Bound resin was then transffered and packed under constant flow into XK16 coulmn (GE Healthcare, cat#18-8773-01). The column was washed with 20CV buffer A (100 Mm Tris pH 7.4) and elution was carried out in one step using 100% buffer B (Citrate/Phosphate pH 3.0). The fractions were titrated with 12.5% (v/v) buffer C (2M Tris pH 8.5) to adjust the pH to ˜7.5 and pooled.

The final buffer was exchanged to DPBS (Dulbecco's Phosphate bufferes saline pH 7.4, /o Ca, w/o Mg) pH 7.4 w/o Ca, w/o Mg using a 53 ml HiPrep™ (GE Healthcare, cat#17-5087-01) desalting column. The protein was filtered through 0.22 μm filter, aliqouted under sterile conditions, and stored at −800 C.

The final protein concentration was determined by BCA total protein assay and protein was analyzed by coomassie stained reducing SDS/PAGE (data not shown). Endotoxin level was determined by colorimetric LAL assay (Limulus Amebocyte Lysate, QCL-1000, Cambrex). The identities of the specific proteins were verified by MS (at the Smoler Proteomics Center, Technion, Haifa, data not shown).

The resulted protein analyses are summerized in table 139.

TABLE 139 Concentration Purity Endotoxins Protein (mg/ml) (%) (EU/mg) C1ORF32-P8-ECD-mFc (SEQ 0.9 >90 1.04 ID NO: 105) IL6-VSIG1 P5 ECD aa26-end 0.94 >95 0.95 mFc (SEQ ID NO: 108) FXYD3-T25P14-ECD-mFc 1.1 >80 0.14 (SEQ ID NO: 103) AI216611-T0P0-mFc (SEQ ID 1.6 >94 0.72 NO: 104) IL6 ILDR1 ECD aa50-160 mFc 1 85 <0.2 (SEQ ID NO: 107) LOC253012-P5-ECD-mFc 1 >95 1.45 (SEQ ID NO: 106)

Example 12 Binding of the ECDs Fc-Fused Proteins of the Invention to Activated T Cells

In order to examine of the ability of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 Fc-fused ECDs described above to bind a putative counter-receptor on T cells, these Fc-fused ECDs were tested on resting or activated T cells. Purified T cells were activated with ConA (Sigma Aldrich, Cat # C5275), followed by incubation with the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 Fc-fused ECDs and analyzed by flow cytometry.

T cells were purified from whole blood by negative selection using RosetteSep™ Human T Cell Enrichment Cocktail (StemCell Technologies, CAT #15061). This resulted in a population of T (CD3+) cells with a purity of ˜90%. Purified T cells (1×105) were cultured for 48 hours in 100 ul of complete RPMI 1640 medium containing 10% FBS, either without any activation or activated with ConA (Concovalin A, 10 ug/ml, Sigma Aldrich, Cat # C5275). Cultures were harvested and stained with the ECDs Fc-fused proteins for 1 hour at 4° C. (VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 ECDs fused to mouse IgG2 Fc). The bound proteins were detected with FITC-conjugated F(ab)2 goat anti-mouse Fc for half an hour at 4° C. (Jackson ImmunoResearch Laboratories. CAT #115-096-071). Samples were analyzed using a FACSCalibur (BD Immunocytometry Systems) and CellQuest software.

FIGS. 62A-D present the binding of the ECDs Fc-fused proteins (VSIG1 (SEQ ID NO:108), LOC253012 (SEQ ID NO:106), AI216611 (SEQ ID NO:104) or C1ORF32 (SEQ ID NO:105)) to resting T cells or T cells activated with Con A for different periods of time. Primary human T cells from three different donors were cultured for a total of 48 hours in the absence of stimulus (0 hrs) or in the presence of Con A, which was added to a final concentration of 10 μg/ml for the last 6, 18, 24 or 48 hours of culture (T cells from donor 5 were cultured with Con A for 0, 6, 18 and 24 hrs, while donors 6 & 7 were cultured for 0, 6, 24 and 48 hrs). Cells were then harvested and incubated with 10 μg/ml of the indicated ECDs Fc-fused proteins. FIG. 62A shows the binding results for Fc-fused VSIG1 ECD; FIG. 62B shows the binding results for Fc-fused LOC253012; FIG. 62C shows the binding results for Fc-fused C1ORF32 ECD; FIG. 62D shows the binding results for Fc-fused AI216611 ECD and FIG. 62E shows the binding results for Fc-fused FXYD3 ECD. The percentage of positive cells was determined as the difference between the positive cells with the indicated protein and the positive cells obtained with FITC-conjugated F(ab)2 goat anti-mouse Fc. FIG. 63 presents the dose response of the binding of B7-like proteins to activated T cells. Purified T cells were cultured for 48 hours. Con A was added for the last 24 hours. Cells were then harvested and stained with increasing concentrations (3, 6, 12, 25 and 50 μg/ml) of Fc-fused VSIG1, LOC253012, C1ORF32, AI216611 or ILDR1 ECDs. As a negative controls, mouse IgG2a was used at the same concentrations.

The results presented in FIGS. 62A-D and 63 demonstrate binding of all the ECDs Fc-fused proteins tested (VSIG1, ILDR1, LOC253012, AI216611 or C1ORF32 ECDs fused to mouse IgG2 Fc, SEQ ID NO:108, 107, 106, 104, or 105, respectively), at binding levels above those of the negative controls: mouse IgG2a (R&D Systems, CAT # MAB003) as isotype control. A substantial binding was detected for Fc-fused VSIG1 ECD and for LOC253012 ECD-Fc to T cells stimulated with ConA. Fc-fused ECDs of C1ORF32 and AI216611 showed a weaker binding to these cells, as can be seen from FIGS. 62A-D and 63. Each protein was found to bind a certain percentage of activated T cells. The rating of binding levels was as follows VSIG1>LOC253012>ILDR1=AI216611>C1ORF32. None of the proteins bound resting T cells (i.e O hrs of ConA in FIGS. 62A-D).

Effect of the ECDs Fc-Fused Proteins of the Invention on T Cells Activation.

In order to test potential costimulatory or/and coinhibitory activity of the soluble proteins of the invention, VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 ECDs fused to mouse IgG2 Fc, SEQ ID NO:108, 107, 106, 104, or 105, respectively, on T cells proliferation and IL-2 secretion, human T cells were cultured in the presence of anti-CD3 ((clone OKT3, eBioscience, CAT #16-0037-85) and the B7-like proteins of the invention, described above. Recombinant human B7-1 protein (R&D Systems, CAT #140-B1) was used as a positive control for costimulatory activity. Recombinant mouse B7-H4 protein (R&D Systems, CAT #4206-B7) was used as positive control for coinhibitory activity.

Flat-bottom 96-well plates were first coated at 4° C. overnight with 3 μg/ml of anti-CD3 mAb (clone OKT3) and subsequently coated with the indicated concentrations of human B7-1 (R&D, 3 μg/ml), mouse B7-H4 (R&D, 10 μg/ml) or the ECDs Fc-fused proteins of the invention, VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 ECDs fused to mouse IgG2 Fc, for 4 h at 37° C. Human T cells were purified from whole blood as described above, and were cultured in the pre-coated 96-well plates (1×105 cells/well) in 250 μl of complete RPMI 1640 medium containing 10% FBS for 48 hrs. Coated plates were washed with PBS three times before seeding of the cells. T cell proliferation was determined by BrdU incorporation by Cell proliferation ELISA, BrdU (colorimetric) (Roche). Cells were labeled with BrdU labeling reagent at a final concentration of 100 μM for the last 18 hours. The plates were then centrifuged (at 300 g, for 10 min,), and supernatants were aspirated and stored at −20° C. for subsequent IL-2 determination using a Human IL-2 ELISA (Diaclone, CAT #850.010 096). BrdU incorporation was measured according to instructions of the manufacturer of the Cell proliferation ELISA, BrdU (colorimetric) (Roche, CAT #11-647-229).

FIGS. 64A-B presents the effect of the ECDs Fc-fused proteins of the invention on T cell proliferation or IL-2 secretion, upon activation with anti-CD3 Ab. FIG. 64A shows the levels of BrdU incorporation. FIG. 64B shows the levels of IL-2 secretion.

The results, presented in FIG. 64A-B, indicate that none of the ECD-Fc fused proteins VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 ECDs fused to mouse IgG2 Fc, showed costimulatory activity. The positive control, B7-1, showed a strong costimulatory activity, as expected. Fc-fused ILDR1_ECD and Fc-fused AI216611 ECD appear to have coinhibitory activity, since they inhibited cell proliferation similarly to B7-H4, in comparison to that obtained in the presence of the negative control: mouse IgG2a (FIG. 64A). However, no significant effect was observed on IL-2 secretion of any of the ECD-Fc fused proteins, VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 Fc-fused ECDs (FIG. 64B).

Example 13 Binding of the ECDs Fc-Fused Proteins of the Invention to Lymphocytes and to and to CD4 Positive Cells

In order to further examine of the ability of the VSIG1, ILDR1, LOC253012, AI216611, FXYD3 and C1ORF32 Fc-fused ECDs to bind a putative counter-receptor on T cells, these Fc-fused ECDs were tested first on lymphocyes. PBMCs were prepared from human peripheral blood, in FACS buffer at 1×10e7/ml. Fc blocker (hIgG (16D10), lot#080706, 1.3 mg/ml) at 30 ug/ml was added and cells were incubated with the blocker on ice for 30 min. Fusion proteins were added at 1 ug/10e6 per stain on ice for 30 min. 2nd Ab was added at 1 ug/100 ul/stain for 25-30 min (G@mIgG-Fc-FITC: Jackson Immunol Lab, 1 mg/ml, code#115-096-071, lot#71453, 1.0 mg/ml, used at 1 ug/stain). Cells were washed with the buffer at each step outlined above. The binding was analyzed by flow cytometry.

FIG. 65 illustrates the binding of the ECDs Fc-fused of the VSIG1, ILDR1, LOC253012, AI216611, FXYD3 or C1ORF32 to lymphocytes. As can be seen from FIG. 65, C1ORF32, AI216611 and ILDR1 bind to a counterpart expressed on lymphocytes.

Next, binding of the VSIG1, ILDR1, LOC253012, AI216611, FXYD3 and C1ORF32 Fc-fused ECDs to CD4+ cells. Fc blocker (hIgG (16D10), lot#080706, 1.3 mg/ml) at 30 ug/ml was added and cells were incubated with the blocker on ice for 30 mM. Fusion proteins were added at 1 ug/10e6 per stain on ice for 30 min. Add 2nd Ab at 1 ug/100 ul/stain for 25-30 min (G@mIgG-Fc-FITC: Jackson Immunol Lab, 1 mg/ml, code#115-096-071, lot#71453, 1.0 mg/ml, used at 1 ug/stain). @CD4 (m@hCD4-APC: BD, cat3555349, lot#44331) was added 20 ul of each per stain, on ice for 30 min.

Cells were washed with the buffer at each step outlined above. The binding was analyzed by flow cytometry.

FIG. 66 illustrates the binding of the ECDs Fc-fused of ILDR1, C1ORF32 and AI216611 to CD4+ cells.

Example 14 Effect of the ECDs Fc-Fused Proteins of the Invention on T Cell Activation

In order to test potential costimulatory or/and coinhibitory activity of the B7-like proteins of the invention, the affect of the VSIG1, ILDR1, LOC253012, AI216611 or C1ORF32 ECDs fused to mouse IgG2 Fc on T cells proliferation was tested. T cells were purified from whole blood by positive selection using CD3 microbeads (microbeads conjugated to monoclonal anti-human CD3 antibodies (isotype: mouse IgG2a) (MACS Whole Blood CD3 Microbeads #130-090-874). Dynabeads are coated with CD3+/−B7 with M-450 Epoxy Dynabeads (Invitrogen cat. No. 140.11). For activation of CD3 T cells, purified CD3 T cells are stimulated with the CD3+CD28 coated beads at 1:1 or 1:05 ratio for various time points as needed. The cells were seeded at 2×10e5 per well in presence or absence of CD3+CD28 (2 ug/ml each)-coated beads and the cell proliferation was measured after 72 hours by tritium-thymidine incorporation. The results are shown in FIG. 67. “CD3” in FIG. 67 mean CD3 only without the presence of a costimulatory or coinhibitory molecule; “CD3+B7.2” means CD3+a known B7 stimulatory control, B7.2; “CD3+B7H4” means CD3 and B7H4 a known B7 inhibitory control; “CD3+B7H3” means CD3 and B7H3 a known B7 stimulatory protein; “CD3+702” means CD3+LOC253012-ECD-Fc fused (SEQ ID NO:106); “CD3+721” means CD3+AI216611-ECD-Fc fused (SEQ ID NO:104); “CD3+754” means CD3+C1ORF32-ECD-Fc fused (SEQ ID NO:105); “CD3+768” means CD3+VSIG1-ECD-Fc fused (SEQ ID NO:108) “CD3+770” means CD3+ILDR1-ECD-Fc fused (SEQ ID NO:107); “CD3+789” means CD3+FXYD3-ECD-Fc fused (SEQ ID NO:103).

As can be seen in FIG. 67, LOC253012-ECD-Fc, AI216611-ECD-Fc, VSIG1-ECD-Fc and FXYD3-ECD-Fc had an inhibitory effect on T cells compared to CD3 alone in 3 different experiments (FIGS. 67 A, B, and C).

Example 15 Interaction of the ECDs-Fc Fused Proteins of the Invention with Resting B Cells, Activated B Cells, and B Cell Derived Lymphoma Cell Lines

Following demonstration of binding of the proteins of the invention to lymphocytes (Example 12 and 13, herein), the ability of the soluble proteins of the invention to bind to B cells was examined.

PBMCs were prepared from human peripheral blood, in FACS buffer at 1×10e7/ml. Fc blocker (hIgG (16D10), lot#080706, 1.3 mg/ml) at 30 μg/ml was added and cells were incubated with the blocker on ice for 30 min. Fusion proteins of invention ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) were added at 1 μg/10e6 per stain on ice for 30 minutes. 2nd Ab was added at 1 μg/100 ul/stain for 25-30 min (G@mIgG-Fc-FITC: Jackson Immunol Lab, 1 mg/ml, code#115-096-071, lot#71453, 1.0 mg/ml, used at 1 ug/stain). Cells were washed with the buffer at each step outlined above. The binding was analyzed by flow cytometry. After that cells were stained with mouse @human IgM-PE (BD Bioscience, CA, USA, cat#555783) which is specific for B cells. The stained cells analyzed by flow cytometry. The @human IgM positive cells were gated to analyze the binding of the fusion proteins of invention to the B cells.

As shown in FIG. 68A, ILDR1-ECD-Fc and C1ORF32-ECD-Fc bound to B cells of all 3 donors tested. AI216611-ECD-Fc exhibited binding to B cells in 1 donor only.

In order to determine the existence of the counterpart on activated B cells, PBMCs were activated with LPS for 72 hours with LPS. Thereafter, binding with the ECDs Fc-fused proteins of the invention ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) was done as described above, and cells were stained with mouse @human CD86-Cy5PE (BD Bioscience, CA, USA, S, cat#555659) and mouse @human CD19-PE (BD Bioscience, CA, USA) antibodies. The activated B cells were defined as double positive CD19+/CD86+ population of cells.

As demonstrated in FIG. 68B, ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105) and AI216611-ECD-Fc (SEQ ID NO:104) showed binding to activated B cells.

In order to determine the existence of the counterpart in B cell malignancies, the binding of the ECDs Fc-fused proteins of invention ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108) were analysed in B cell lymphoma cell lines. Raji (ATCC# CCL-86) and Daudi (ATCC# CCL-213) cells were purchased from ATCC and maintained in RPMI+10% FBS. The cells were stained with B7s protein or controls at 10 μg/ml and thereafter with FITC-conjugated goat anti-mouse IgG Fc (Jackson Immunol Lab, NJ, USA, cat#115-096-071, lot#71453).

FIG. 68C illustrates the binding of the Fc-fused ECDs of the B7-like proteins of the invention (ILDR1-ECD-Fc (SEQ ID NO:107), C1ORF32-ECD-Fc (SEQ ID NO:105), AI216611-ECD-Fc (SEQ ID NO:104), LOC253012-ECD-Fc (SEQ ID NO:106), FXYD3-ECD-Fc (SEQ ID NO:103), and VSIG1-ECD-Fc (SEQ ID NO:108)) to the B cell lymphoma cell lines. ILDR1-ECD-Fc (SEQ ID NO:107) showed a clear binding the both B cell lymphoma cell lines.

Example 16 Interaction of the ECD-Fc Fused Proteins of the Invention with Known B7

The interaction of AI216611 proteins of the invention with various known ligands of the B7 family was analyzed. Since AI216611 was predicted as a presumed CD28 receptor it was hypothesized to bind to a known B7 ligand, B7H4, which is considered orphan (its counterpart receptor has not yet been recognized).

The analysis of the interaction between B7H4-Ig (R&D Systems, Inc. cat.#4206-B7) and Fc fused AI216611 ECD (SEQ ID NO:104) was conducted using the BIAcore 3000 system (Uppsala, Sweden) (Pharmacia Biosensor, Uppsala, Sweden) that employs surface Plasmon resonance for directly measuring intermolecular interactions. Fc fused AI216611 ECD (SEQ ID NO:104) (400-500 resonance units (RU)) was immobilized directly to the sensor CM5 chip. Solution containing two different concentrations of B7H4-Ig (5 and 10 micro molar) was injected. As control, the solutions were also injected onto an empty flow cell with no ligand immobilized.

Data was analyzed using BIAevaluation software (GraphPad Software Inc., San Diego Calif.). A zero baseline level was obtained by subtracting the background responses from injection of the analytes through a control flow cell with no ligand immobilized.

As can be seen from FIG. 69, a slight interaction between Fc fused AI216611 ECD (SEQ ID NO:104) and B7H4 was found in 5 and 10 μM of AI216611.

Example 17 Development of Mouse Monoclonal Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32 and Anti-FXYD3 Antibodies

In order to test the expression of B7-Like proteins in different cancer tissues by immunohistochemistry, monoclonal mouse antibodies specific for Fc-fused ECDs of the proteins of invention were developed.

Development of Mouse Monoclonal Antibodies:

Four groups of the Balb/c mice (3 mice per group) were immunized with 4 Fc-fused ECDs proteins of the invention: VSIG1 (SEQ ID NO:108), LOC253012 (SEQ ID NO:106), C1ORF32 (SEQ ID NO:105) and FXYD3 (SEQ ID NO:103). The immunizations were performed 8 times at one week intervals in multiple sites, subcutaneous and intraperitoneal. Mice were bled ten days following the 4th and 8th immunizations. Serum was screened for antibody titer using a Direct ELISA protocol described below.

ELISA plates were coated with 50 μl/well of 2.5 μg/mL Fc-fused proteins (VSIG1, LOC253012, C1ORF32, FXYD3 ECDs fused to mouse IgG2 Fc, SEQ ID NOs: 108, 106, 105, 103, respectively) diluted in DPBS for 1 hour at room temperature (RT). Human IgG fused to mouse Fc region was used as a negative control. After that, plates were blocked with 300 μl/well of 1% BSA/DPBS for 15 min at RT. Following the blocking step, serially diluted sera from immunized mice and irrelevant mouse IgG were transferred to the blocked ELISA plates and incubated for 1 hour at RT. Afterwards, plates were washed 3 times with 300 μl/well washing buffer (DPBS with 0.05% Tween 20, pH 7.2-7.4). For detection, plates were incubated for 1 hour at RT with 50 μl/well of Goat anti-Mouse Kappa Light Chain Antibody at 1:1000 dilution followed by an extensive wash (6 times with 300 μl/well of washing buffer) and incubation with the substrate. The substrate, 2,2′-Azino-bis-(3-ethylbenzthiazoline-6-sulfonic acid (ABTS), at 100 μL/well was added and incubated for about 5 min at RT before plates were read at 414 nm using a Molecular Devices SPECTRAmax 340 PC plate reader and SOFTmax PRO software.

Serum antibody titer was defined as the dilution of serum that produces a signal that was twice that of the background.

Results of the ELISA test of the immunized sera after 4 immunizations are summarized in the Table 140. Data show that after 4 immunizations, 2 mice groups (immunized with LOC253012 and VSIG1 Fc fused proteins ECDs) developed antibody titers sufficient for hybridoma production.

The mice that showed highest antibody serum titers, were selected for hybridoma production. The splenocytes were fused with mouse myeloma cell line Ag8.653. The supernatant of the hybridoma clones were tested by direct ELISA (as described above) using plates coated with relevant and irrelevant coatings. The results are summzarised in Table 141A and Table 141B.

The results demonstrate that production of hybridoma cell lines resulted in 14 clones specifically recognizing LOC253012 (Table 141A, bold) and 14 clones specifically recognizing VSIG1 (Table 141B, bold).

For the rest of the proteins, four additional immunizations were performed in order to facilitate the serum antibody titers development for the rest of the proteins. The sera titers after the 8th immunization were tested by direct ELISA. Results are summarized in Table 142. The results demonstrate that after 8 immunizations the mice immunized with FXYD-Fc fused ECD (SEQ ID NO:103) and C1ORF32-Fc fused ECD (SEQ ID NO:103) developed sufficient antibody titers for hybridoma production. In the next step, the best responders will be selected for hybridoma production and monoclonal antibody manufacturing.

Mouse monoclonal anti-ILDR1 and anti-AI216611 antibodies are developed similarly.

The Monoclonal Antibodies for each of the antigens of the invention (VSIG1, LOC253012, C1ORF32, FXYD3, AI216611 and ILDR1, SEQ ID NOs: 108, 106, 105, 103, 104 and 107, respectively) are used for Immunohistochemistry analysis in order to verify the expression profile of each of these putative proteins in cancer and healthy tissues.

TABLE 140 Antibody sera titers of the immunized mice after 4 immunizations. ELISA Plates Coatings. Immunogen Mouse # C1ORF32 FXYD3 LOC253012 VSIG1 human IgG-moFc C1ORF32 167229 378 212 <50 181 152 167230 612 319 <50 383 159 167231 599 445 276 934 398 FXYD3 167232 1,409 2,532 962 2,229 1,433 167233 1,379 2,320 695 2,777 1,211 167234 1,585 4,604 615 3,625 1,751 LOC253012 167223 <50 51 18,869 <50 68 167224 <50 93 9,939 73 156 167225 93 560 3,025 268 116 VSIG1 167226 <50 <50 <50 10,653 <50 167227 158 603 <50 18,085 58 167228 412 751 58 93,059 83

TABLE 141A Post fusional clones resulted from mouse #167223, immunized with LOC253012 ELISA Plates Coatings Clone C1ORF32 FXYD3 LOC253012 VSIG1 moFC ID OD at 441 nm 2G2 0.188 0.214 2.296 0.216 0.278 3A8 2.053 2.450 1.926 1.787 0.326 4A8 0.201 0.225 2.553 0.222 0.279 6B10 0.227 0.206 2.335 0.227 0.293 8G10 0.476 0.346 1.562 0.267 0.487 8G11 0.192 0.200 2.178 0.220 0.274 10A3 0.189 0.190 1.654 0.215 0.272 10F2 0.246 0.247 1.720 0.242 0.350 12D5 0.198 0.190 1.619 0.224 0.291 13A4 0.252 0.221 1.847 0.228 0.312 13F11 0.219 0.194 1.865 0.223 0.296 13H2 0.216 0.229 1.404 0.255 0.300 14D11 0.199 0.230 2.183 0.225 0.294 16A10 1.285 2.130 1.239 0.972 0.277 16C10 2.159 2.516 1.927 1.908 0.273 16F10 0.183 0.179 0.235 0.203 0.271 17E5 0.188 0.193 1.943 0.214 0.270 18G4 0.202 0.209 1.843 0.216 0.284

TABLE 141B Post fusional clones resulted from mouse #167228 immunized with VSIG1 ELISA Plates Coatings. Clone C1ORF32 FXYD3 LOC253012 VSIG1 moFC ID OD at 441 nm 3F8 0.192 0.193 0.240 2.279 0.283 4D5 0.220 0.206 0.251 2.174 0.299 4D6 0.206 0.207 0.227 2.808 0.279 5B6 0.197 0.188 0.208 0.380 0.261 6G2 0.227 0.208 0.198 1.880 0.294 7C1 0.208 0.212 0.209 2.392 0.278 7E3 2.284 2.804 1.495 2.278 0.398 7H3 0.207 0.232 0.197 1.530 0.295 9F4 0.226 0.274 0.175 1.965 0.372 10B10 0.228 0.274 0.219 0.407 0.330 11B2 0.214 0.247 0.207 2.733 0.316 11F3 2.638 3.052 1.919 2.575 0.313 11G10 0.244 0.249 0.191 2.076 0.337 13A1 0.240 0.239 0.195 2.469 0.316 13H5 2.782 2.886 1.963 2.305 0.293 14D8 0.218 0.230 0.187 2.660 0.294 15E8 0.211 0.240 0.180 1.966 0.293 17E6 0.602 0.547 0.271 0.341 0.662 19G6 0.439 0.490 0.245 1.279 0.318

TABLE 142 Antibody sera titers of the immunized mice after 8 immunizations. ELISA Plates Coatings Immunogen Mouse # C1ORF32 FXYD3 LOC253012 VSIG1 human IgG-moFc C1ORF32 167229 21,760 2,192 295 300 2,287 167230 69,543 613 59 265 693 167231 23,100 1,952 756 1,645 2,487 FXYD3 167232 3,237 58,240 1,564 2,622 6,515 167233 5,061 10,786 2,125 4,959 7,664 167234 3,445 122,929 811 2,267 7,061 LOC253012 167223 Fused 167224 2,641 18,011 328,050 5,491 260 167225 6,132 23,589 173,452 7,662 90 VSIG1 167226 819 6,096 207 471,316 89 167227 39,852 102,238 18,463 532,487 256 167228 Fused

Immunohistochemical Analysis

Immunohistochemistry enables the visualization (using light or confocal microscopy) of the tissue distribution of specific antigens (or epitopes). The process localizes protein targets of interest by applying specific monoclonal or polyclonal antibodies to tissue surfaces in a process called antibody incubation.

This method involves detection of a substrate in situ in fixed cells by substrate specific antibodies. The substrate specific antibodies may be enzyme linked or linked to fluorophores. Detection is carried out by microscopy and subjective evaluation. If enzyme linked antibodies are employed, a colorimetric reaction may be required.

The immunohistochemical analysis performed for the antigens of the invention (VSIG1, LOC253012, C1ORF32, FXYD3, AI216611 and ILDR1, SEQ ID NOs: 108, 106, 105, 103, 104 and 107, respectively) consist of two phases:

Phase I: Antibody calibration: A dilution series of each of the antibodies developed against the specific protein antigens is run using selected formalin-fixed paraffin-embedded (FFPE) control tissues and cell lines. The best performing antibody is selected for Phase II.

Phase II: Protein distribution and localization analysis: Using the optimal antibody concentration selected in Phase I, the distribution and localization of VSIG1, LOC253012, C1ORF32, FXYD3, AI216611 and ILDR1 proteins is analyzed in Tissue Arrays consisting of cancer and healthy tissues, looking for differential expression of the in some of the cancer samples, as compared with healthy samples.

Example 17 Development of Fully Human Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32 and Anti-FXYD3 Antibodies

Generation Of Human Monoclonal Antibodies Against VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 Antigen

Fusion proteins composed of the extracellular domain of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 linked to a mouse IgG2 Fc polypeptide are generated by standard recombinant methods and used as antigen for immunization.

Transgenic HuMab Mouse.

Fully human monoclonal antibodies to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 are prepared using mice from the HCo7 strain of the transgenic HuMab Mouse. RTM., which expresses human antibody genes. In this mouse strain, the endogenous mouse kappa light chain gene has been homozygously disrupted as described in Chen et al. (1993) EMBO J. 12:811-820 and the endogenous mouse heavy chain gene has been homozygously disrupted as described in Example 1 of PCT Publication WO 01/09187. Furthermore, this mouse strain carries a human kappa light chain transgene, KCo5, as described in Fishwild et al. (1996) Nature Biotechnology 14:845-851, and a human heavy chain transgene, HCo7, as described in U.S. Pat. Nos. 5,545,806; 5,625,825; and 5,545,807.

HuMab Immunizations:

To generate fully human monoclonal antibodies to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3, mice of the HCo7 HuMab Mouse. RTM. (strain can be immunized with purified recombinant VSIG1 fusion protein derived from mammalian cells that are transfected with an expression vector containing the gene encoding the fusion protein. General immunization schemes for the HuMab Mouse. RTM. are described in Lonberg, N. et al (1994) Nature 368(6474): 856-859; Fishwild, D. et al. (1996) Nature Biotechnology 14: 845-851 and PCT Publication WO 98/24884. The mice are 6-16 weeks of age upon the first infusion of antigen. A purified recombinant VSIG1 antigen preparation (5-50 .mu.g, purified from transfected mammalian cells expressing VSIG1 fusion protein) is used to immunize the HuMab mice intraperitoneally.

Transgenic mice are immunized twice with antigen in complete Freund's adjuvant or Ribi adjuvant IP, followed by 3-21 days IP (up to a total of 11 immunizations) with the antigen in incomplete Freund's or Ribi adjuvant. The immune response is monitored by retroorbital bleeds. The plasma is screened by ELISA (as described below), and mice with sufficient titers of anti-VSIG1 human immunoglobulin are used for fusions. Mice are boosted intravenously with antigen 3 days before sacrifice and removal of the spleen.

Selection of HuMab mice.TM. Producing Anti-VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 and FXYD3 Antibodies:

To select HuMab mice.TM. producing antibodies that bind VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 sera from immunized mice is tested by a modified ELISA as originally described by Fishwild, D. et al. (1996). Briefly, microtiter plates are coated with purified recombinant VSIG1 fusion protein at 1-2 .mu.g/ml in PBS, 50 .mu.l/wells incubated 4 degrees C. overnight then blocked with 200 .mu.l/well of 5% BSA in PBS. Dilutions of plasma from VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3-immunized mice are added to each well and incubated for 1-2 hours at ambient temperature. The plates are washed with PBS/Tween and then incubated with a goat-anti-human kappa light chain polyclonal antibody conjugated with alkaline phosphatase for 1 hour at room temperature. After washing, the plates are developed with pNPP substrate and analyzed by spectrophotometer at OD 415-650. Mice that developed the highest titers of anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies are used for fusions. Fusions are performed as described below and hybridoma supernatants are tested for anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 activity by ELISA.

Generation of Hybridomas Producing Human Monoclonal Antibodies to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3

The mouse splenocytes, isolated from the HuMab mice, are fused with PEG to a mouse myeloma cell line based upon standard protocols. The resulting hybridomas are then screened for the production of antigen-specific antibodies. Single cell suspensions of splenic lymphocytes from immunized mice are fused to one-fourth the number of P3X63 Ag8.6.53 (ATCC CRL 1580) nonsecreting mouse myeloma cells with 50% PEG (Sigma). Cells are plated at approximately 1×10-5/well in flat bottom microtiter plate, followed by about two week incubation in selective medium containing 10% fetal calf serum, supplemented with origen (IGEN) in RPMI, L-glutamine, sodium pyruvate, HEPES, penicillin, streptamycin, gentamycin, 1×HAT, and beta-mercaptoethanol. After 1-2 weeks, cells are cultured in medium in which the HAT is replaced with HT. Individual wells are then screened by ELISA (described above) for human anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 monoclonal IgG antibodies. Once extensive hybridoma growth occurred, medium is monitored usually after 10-14 days. The antibody secreting hybridomas are replated, screened again and, if still positive for human IgG, anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 monoclonal antibodies are subcloned at least twice by limiting dilution. The stable subclones are then cultured in vitro to generate small amounts of antibody in tissue culture medium for further characterization.

Hybridoma clones are selected for further analysis.

Structural Characterization Of Desired Anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 Human Monoclonal Antibodies

The cDNA sequences encoding the heavy and light chain variable regions of the obtained anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 monoclonal antibodies are obtained from the resultant hybridomas, respectively, using standard PCR techniques and are sequenced using standard DNA sequencing techniques.

The nucleotide and amino acid sequences of the heavy chain variable region and of the light chain variable region are identified. These sequences may be compared to known human germline immunoglobulin light and heavy chain sequences and the CDRs of each heavy and light of the obtained anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 sequences identified.

Characterization of Binding Specificity and Binding Kinetics of Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32 or Anti-FXYD3 Human Monoclonal Antibodies

The binding affinity, binding kinetics, binding specificity, and cross-competition of anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies are examined by Biacore analysis. Also, binding specificity is examined by flow cytometry.

Binding Affinity and Kinetics

Anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies produced according to the invention are characterized for affinities and binding kinetics by Biacore analysis (Biacore AB, Uppsala, Sweden). Purified recombinant human VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 fusion protein is covalently linked to a CM5 chip (carboxy methyl dextran coated chip) via primary amines, using standard amine coupling chemistry and kit provided by Biacore. Binding is measured by flowing the antibodies in HBS EP buffer (provided by BIAcore AB) at a concentration of 267 nM at a flow rate of 50 .mu.l/min. The antigen-association antibodies association kinetics is followed for 3 minutes and the dissociation kinetics is followed for 7 minutes. The association and dissociation curves are fit to a 1:1 Langmuir binding model using BIAevaluation software (Biacore AB). To minimize the effects of avidity in the estimation of the binding constants, only the initial segment of data corresponding to association and dissociation phases are used for fitting.

Epitope Mapping of Obtained Anti-VSIG1, Anti-ILDR1, Anti-LOC253012, Anti-AI216611, Anti-C1ORF32 or Anti-FXYD3 Antibodies

Biacore is used to determine epitope grouping of anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 HuMAbs. Obtained anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies are used to map their epitopes on the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen, respectively. These different antibodies are coated on three different surfaces of the same chip to 8000 RUs each. Dilutions of each of the mAbs are made, starting at 10 mu.g/mL and is incubated with Fc fused VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 (50 nM) for one hour. The incubated complex is injected over all the three surfaces (and a blank surface) at the same time for 1.5 minutes at a flow rate of 20 .mu.L/min. Signal from each surface at end of 1.5 minutes, after subtraction of appropriate blanks, has been plotted against concentration of mAb in the complex. Upon analysis of the data, the anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 antibodies are categorized into different epitope groups depending on the epitope mapping results. The functional properties thereof are also compared.

Chinese hamster ovary (CHO) cell lines that express VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 protein at the cell surface are developed and used to determine the specificity of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 HuMAbs by flow cytometry. CHO cells are transfected with expression plasmids containing full length cDNA encoding a transmembrane forms of VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen or a variant thereof. The transfected proteins contained an epitope tag at the N-terminus are used for detection by an antibody specific for the epitope. Binding of a anti-VSIG1, anti-ILDR1, anti-LOC253012, anti-AI216611, anti-C1ORF32 or anti-FXYD3 MAb is assessed by incubating the transfected cells with each of the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 Abs at a concentration of 10 mu.g/ml. The cells are washed and binding is detected with a FITC-labeled anti-human IgG Ab. A murine anti-epitope tag Ab, followed by labeled anti-murine IgG, is used as the positive control. Non-specific human and murine Abs are used as negative controls. The obtained data is used to assess the specificity of the HuMAbs for the VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3. antigen target.

These antibodies and other antibodies specific to VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 may be used in the afore-described anti-VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 related therapies such as treatment of cancers wherein VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen is differentially expressed such as lung cancer, colon cancer and ovarian cancer and/or for modulating (enhancing or inhibiting) B7 immune co-stimulation involving the VSIG1, ILDR1, LOC253012, AI216611 or C1ORF32 antigen such as in the treatment of cancers and autoimmune diseases wherein such antibodies will e.g., prevent negative stimulation of T cell activity against desired target cancer cells or prevent the positive stimulation of T cell activity thereby eliciting a desired anti-autoimmune effect.

The invention has been described and prophetic embodiments provided relating to manufacture and selection of desired anti-VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antibodies for use as therapeutics and diagnostic methods wherein the disease or condition is associated with VSIG1, ILDR1, LOC253012, AI216611, C1ORF32 or FXYD3 antigen. The invention is now further described by the claims which follow. 

What is claimed is:
 1. An isolated polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO:
 299. 2. An isolated polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299.
 3. An isolated polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299 and a functional group that improves stability, penetration through cellular membranes, penetration through tissue barriers, tissue localization, efficacy, or causes decreased clearance, decreased toxicity, improved selectivity, or improved resistance to expulsion by cellular pumps.
 4. The isolated polypeptide of claim 3, having at least one peptidomimetic organic moiety.
 5. An isolated polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, wherein the polypeptide has been chemically modified, and wherein the chemical modification is selected from acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, pegylation, prenylation, phosphorylation, and ubiquitination.
 6. A fusion protein comprising an isolated polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299; the polypeptide of amino acid sequence SEQ ID NO: 299 and a functional group that improves stability, penetration through cellular membranes, penetration through tissue barriers, tissue localization, efficacy, or causes decreased clearance, decreased toxicity, improved selectivity, or improved resistance to expulsion by cellular pumps; or the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, wherein the polypeptide has been chemically modified, and wherein the chemical modification is selected from acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, pegylation, prenylation, phosphorylation, and ubiquitination; joined to a heterologous polypeptide.
 7. The fusion protein of claim 6, comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO: 299 joined to a heterologous polypeptide.
 8. The fusion protein of claim 6, wherein the heterologous sequence comprises a VASP domain.
 9. The fusion protein of claim 6, wherein the heterologous sequence comprises at least a portion of an immunoglobulin molecule.
 10. The fusion protein of claim 9, wherein said at least a portion of said immunoglobulin molecule comprises an immunoglobulin heavy chain constant region.
 11. The fusion protein of claim 10, wherein the immunoglobulin heavy chain constant region is an Fc fragment.
 12. The fusion protein of claim 11, wherein the immunoglobulin heavy chain constant region is an isotype selected from the group consisting of an IgG1, IgG2, IgG3, IgG4, IgM, IgE, IgA and IgD.
 13. The fusion protein of claim 6, wherein the fusion protein modulates lymphocyte activation in vitro or in vivo.
 14. A method of producing a fusion protein according to claim 6 or a protein comprising at least one of the polypeptide of amino acid sequence SEQ ID NO: 299; the polypeptide of amino acid sequence SEQ ID NO: 299 and a functional group that improves stability, penetration through cellular membranes, penetration through tissue barriers, tissue localization, efficacy, or causes decreased clearance, decreased toxicity, improved selectivity, or improved resistance to expulsion by cellular pumps; or the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, wherein the polypeptide has been modified, and wherein the modification is selected from acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, prenylation, phosphorylation, and ubiquitination; the method comprising introducing a vector comprising a polynucleotide encoding for said protein to a recombinant cell; and producing the protein with the recombinant cell.
 15. The method of claim 14, wherein the recombinant cell is a prokaryotic or eukaryotic cell.
 16. The method of claim 15, wherein the recombinant cell is a mammalian cell, yeast cell, insect cell or bacterial cell.
 17. The method of claim 16, wherein the recombinant cell is a CHO cell, a COS cell, or a 293 cell or E. coli cell.
 18. A pharmaceutical composition comprising the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, and a pharmaceutically acceptable diluent or carrier.
 19. A pharmaceutical composition comprising a fusion polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299 and a heterologous amino acid sequence, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299 and a heterologous amino acid sequence, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299 and a heterologous amino acid sequence; or the polypeptide of amino acid sequence SEQ ID NO: 299 and a functional group that improves stability, penetration through cellular membranes, penetration through tissue barriers, tissue localization, efficacy, or causes decreased clearance, decreased toxicity, improved selectivity, or improved resistance to expulsion by cellular pumps and a heterologous amino acid sequence; or the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, wherein the polypeptide has been chemically modified, and wherein the chemical modification is selected from acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, pegylation, prenylation, phosphorylation, and ubiquitination, and a heterologous amino acid sequence; and a pharmaceutically acceptable diluent or carrier.
 20. A method for treating an autoimmune disease or preventing transplant rejection or graft versus host disease in a subject in need thereof comprising administering to the subject a therapeutically effective amount of any of polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299, polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, or the pharmaceutical composition of claim
 18. 21. A method for treating an autoimmune disease or preventing transplant rejection or graft versus host disease in a subject in need thereof comprising administering to the subject a therapeutically effective amount of any of a fusion polypeptide comprising the polypeptide of amino acid sequence SEQ ID NO: 299 and a heterologous amino acid sequence, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299 and a heterologous amino acid sequence, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299 and a heterologous amino acid sequence; or the polypeptide of amino acid sequence SEQ ID NO: 299 and a functional group that improves stability, penetration through cellular membranes, penetration through tissue barriers, tissue localization, efficacy, or causes decreased clearance, decreased toxicity, improved selectivity, or improved resistance to expulsion by cellular pumps and a heterologous amino acid sequence; or the polypeptide of amino acid sequence SEQ ID NO: 299, a polypeptide comprising amino acid residues 21-184 of SEQ ID NO: 299, or a polypeptide comprising a sequence of amino acid residues having at least 95% sequence identity with amino acid residues 21-184 of SEQ ID NO:299, wherein the polypeptide has been chemically modified, and wherein the chemical modification is selected from acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor formation, covalent attachment of a lipid or lipid derivative, methylation, myristylation, pegylation, prenylation, phosphorylation, and ubiquitination, and a heterologous amino acid sequence; or the pharmaceutical composition of claim
 19. 22. The method of claim 20, further comprising administering a moiety useful for treating an autoimmune disease, transplant rejection or graft versus host disease.
 23. The method of claim 22, wherein the moiety is a cytokine antibody, cytokine receptor antibody, drug, or another immunomodulatory agent.
 24. The method of claim 23, wherein the autoimmune disease is selected from a group consisting of multiple sclerosis; psoriasis; rheumatoid arthritis; systemic lupus erythematosus; Crohn's disease, ulcerative colitis; benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis.
 25. The method of claim 21, further comprising administering a moiety useful for treating an autoimmune disease, transplant rejection or graft versus host disease.
 26. The method of claim 25, wherein the moiety is a cytokine antibody, cytokine receptor antibody, drug, or another immunomodulatory agent.
 27. The method of claim 26, wherein the autoimmune disease is selected from a group consisting of multiple sclerosis; psoriasis; rheumatoid arthritis; systemic lupus erythematosus; Crohn's disease, ulcerative colitis; benign lymphocytic angiitis, lupus erythematosus, Hashimoto's thyroiditis, primary myxedema, Graves disease, pernicious anemia, autoimmune atrophic gastritis, Addison's disease, insulin dependent diabetes mellitis, Good pasture's syndrome, myasthenia gravis, pemphigus, sympathetic ophthalmia, autoimmune uveitis, autoimmune hemolytic anemia, idiopathic thrombocytopenia, primary biliary cirrhosis, chronic action hepatitis, ulceratis colitis, Sjogren's syndrome, rheumatic disease, polymyositis, scleroderma, mixed connective tissue disease, inflammatory rheumatism, degenerative rheumatism, extra-articular rheumatism, collagen diseases, chronic polyarthritis, psoriasis arthropathica, ankylosing spondylitis, juvenile rheumatoid arthritis, periarthritis humeroscapularis, panarteriitis nodosa, progressive systemic scleroderma, arthritis uratica, dermatomyositis, muscular rheumatism, myositis, myogelosis and chondrocalcinosis. 